marcdotson / counting-cockroaches

Using social media to assess the severity of service failures.
MIT License
3 stars 0 forks source link

Label Tweets as Complaints #10

Closed marcdotson closed 5 years ago

marcdotson commented 5 years ago

We need to create a test dataset by labeling 5000 tweets as complaints. We'll use this to validate and compare different classification techniques.

Adriel has the sheet -- just post a link here for the shared document.

marcdotson commented 5 years ago

Note we might need to include a separate column as "maybe" for tweets that aren't obviously complaints.

AdrielC commented 5 years ago

https://docs.google.com/spreadsheets/d/1rU3Gt81fwjHAcB0-a0N3rwsfquKQJjxNK838lhsCDCg/edit#gid=65146049

Here is a link to all of our labeled tweets. 4960 labeled tweets in total.

marcdotson commented 5 years ago

Let's put this data on the repo.

marcdotson commented 5 years ago

Do we need to build a better test data set? Let's get it in the Data directory. @AdrielC @dallin-cardon