Closed ryantmoore closed 5 years ago
This has been completed as of commit #a12c18044401775da6a7337c4438070b4fc37d25. Closing issue.
From lines 115-6, can you verify that
(tweets$screen_name %in% bots$user.screen_name)
will identify unique instances of tweets$screen_name
? That is, that there are no duplicates in tweets$screen_name
.tweets
and df_userbots
have the same observations in the same order?If either is not true, I think we need to refix the calculation.
With recent changes, this is handled differently and is now accurate.
Fixed in commit 97a4a1926b2337cafc94e67fbc716f44f73ff5ad
As of 9a94bb3, the user-level proportion defined in
botscan.R
at line 99 divides the number of bots identified in the full list of usernames by the number of unique usernames. I propose that this should be the number of unique bots identified divided by the number of unique usernames.Proposed fix:
Move line 86, used only in the calculation above,
within the
if(user_level){}
and structure as