digitalepidemiologylab / covid-twitter-bert

Pretrained BERT model for analysing COVID-19 Twitter data
MIT License
184 stars 27 forks source link

COVID Category (CC) dataset has 4 invalid formats for Tweet ID #6

Closed ogencoglu closed 4 years ago

ogencoglu commented 4 years ago

For example line 754 is: 1.22065E+18,category_news

This breaks the download (hydration) unfortunately.

mar-muel commented 4 years ago

Thanks for bringing this to our attention! The 4 IDs had been accidentally converted to float format. Should work now!