ecprice / newsdiffs

Automatic scraper that tracks changes in news articles over time.
Other
497 stars 135 forks source link

Fix BBC scraper heading and date #58

Closed tomwieck closed 2 years ago

tomwieck commented 6 years ago

Tested this locally and seems to work correctly.

Worth noting that the files are spit out into one flat directory due to the BBCs URL structure (all articles come under /news/)

Also looks like story bodies for some AV articles is still not working.

To get it working locally I also had to follow this fix https://github.com/ecprice/newsdiffs/issues/25