KaiDMML / FakeNewsNet

This is a dataset for fake news detection research
1.07k stars 424 forks source link

Size of dataset #25

Open SaschaStenger opened 4 years ago

SaschaStenger commented 4 years ago

Hello

I do have a question regarding downloading the dataset. As downloading is very slow for me, i would like to know how far I am in the downloading process. So far i am downloading the politifact part of the dataset, including tweets and retweets. The part i already downloaded onto disk currently takes up 7.1GB of storage. Does anyone know how big the final dataset will be?

rlleshi commented 4 years ago

@SaschaStenger So how big was it? I am currectly at 8,5GB and counting...

SaschaStenger commented 4 years ago

So i have been restructuring the download a little bit for my purposes. I managed to download all the tweets for both sources within a day, but the retweets are still giving me some trouble. But i can tell you, that the tweets also took up 6GB of disk space. Although that is for politifact and gossipcop together. I haven't managed to get the retweet download running in a satisfactory way so far. But i'll let you know, one i do.

rlleshi commented 4 years ago

Thanks!