Gautamshahi / Misinformation_COVID-19

Dataset for analysing Propagation of COVID-19 Misinformation on Twitter
Creative Commons Zero v1.0 Universal
15 stars 12 forks source link

Scientific notation in tweet IDs #1

Closed gattilorenz closed 3 years ago

gattilorenz commented 3 years ago

I have noticed that at least in dataset 1 there is a problem with the tweet IDs, where they are sometimes written in scientific notation, e.g. 1.2316E+18 FALSE

Gautamshahi commented 3 years ago

Hi, It might be an issue of your editor. Which editor or tool are you using?

gattilorenz commented 3 years ago

grep, after downloading it from Github:

$ grep -n E+ * Dataset-I:339:1.25018E+18 FALSE Dataset-I:490:1.24021E+18 FALSE Dataset-I:859:1.2316E+18 FALSE

Thus lines 339, 490, 859 of Dataset-I

Gautamshahi commented 3 years ago

Thanks for pointing out, I have updated the dataset with more classes.

gattilorenz commented 3 years ago

Nice, thanks!