Twitter Bot Detection

What is this Project about

In an Undergrad Research Project, we were tasked with creating a program to find bots injected in a large dataset of tweets by our peers.

What did we do

To find the bots, we implemented many features, such as clustering based on cosine similarity, removing profanity, detecting emojis and checking grammar mistakes. After a long time of deliberating and brainstorming with my team, we found that those features were the ones that stood out the most between bots and real users.

What results did we find

For the second dataset, we had the following results:

We have found: 58 bots

There are 64 users in the users_df

90.625% of the users in users_df are bots

89.23076923076924% bots have been found

For the first dataset, we had the following results:

We have found: 2 bots

There are 2 users in the users_df

100% of the users in users_df are bots

100% bots have been found

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
src		src
README.md		README.md
dataset.english.2024-03-22.augmented.B2.24.json		dataset.english.2024-03-22.augmented.B2.24.json
dataset.english.2024-03-22.bot_posts.B2.24.json		dataset.english.2024-03-22.bot_posts.B2.24.json
dataset.english.2024-03-22.bot_profiles.B2.2.json		dataset.english.2024-03-22.bot_profiles.B2.2.json
dataset.english.2024-03-22.users.augmented.B2.2.json		dataset.english.2024-03-22.users.augmented.B2.2.json
detection1.py		detection1.py
detection2.py		detection2.py
diff.json		diff.json
filtered_users_dataset1.txt		filtered_users_dataset1.txt
filtered_users_dataset2.txt		filtered_users_dataset2.txt
tweets.tsv		tweets.tsv
tweets_90_final.tsv		tweets_90_final.tsv
tweets_LI.tsv		tweets_LI.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter Bot Detection

What is this Project about

What did we do

What results did we find

For the second dataset, we had the following results:

For the first dataset, we had the following results:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Twitter Bot Detection

What is this Project about

What did we do

What results did we find

For the second dataset, we had the following results:

For the first dataset, we had the following results:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages