Open ottofabian opened 6 years ago
Crawled data for domains: Sports [Celebrities] Broadcast Star Actors Politics FunMix (Contains various Twitterprofiles that may not match threshhold criteria, but could be interesting anyways. E.g. Pope Francis, Dalai Lama, Lord Voldemort etc.)
Each domain contains at least 20 individual accounts, with 1000-1100 Tweets each. So 5 Categories a 20 authors x 1000 Tweets = at least 100.000 Tweets. Will Push soon
The goal should be to get 2 - 4 more domains in order to check if the Authorship identification also works for different domains of "Tweeters". Possible ideas: