alan-turing-institute / misinformation-crawler

Web crawler to collect snapshots of articles to web archive
MIT License
5 stars 2 forks source link

Fix missing bylines christianpost #316

Closed edwardchalstrey1 closed 5 years ago

edwardchalstrey1 commented 5 years ago

Added 2nd test, crawler now showing no articles with bylines missing when I ran up to ~3000 pages

Previous byline xpath was over-specific

Fixes #315