alan-turing-institute / misinformation-crawler

Web crawler to collect snapshots of articles to web archive
MIT License
5 stars 2 forks source link

Tab delimited article input files #362

Open dongpng opened 4 years ago

dongpng commented 4 years ago

Given earlier discussions, we thought it would be better to use tab delimiters. The released dataset therefore contains tab delimited files (.tsv)

The 'Crawling a list of URLs' option still expects a .csv file, can we support both csv and tab delimited files?