alan-turing-institute / misinformation-crawler

Web crawler to collect snapshots of articles to web archive
MIT License
5 stars 2 forks source link

vanityfair.com extraction issues #329

Closed jemrobinson closed 5 years ago

jemrobinson commented 5 years ago

No articles being extracted. Errors are of the following form:

2019-07-22 12:17:38     INFO: Searching for an article at: https://www.vanityfair.com/news/2017/10/hillary-clinton-donald-trump-dossier
2019-07-22 12:17:38  WARNING: No elements could be found from https://www.vanityfair.com/news/2017/10/hillary-clinton-donald-trump-dossier matching //div[contains(@class, "article-main")] expected by match_rule 'single'. Returning None.