ClimateMisinformation / Scrapers

Web scrapers
5 stars 1 forks source link

Scrape the dailymail #4

Closed ricjhill closed 3 years ago

ricjhill commented 3 years ago

It is one of the most read online news resources in the UK. It is also popular in other countries. They have a text only website, so I will give it a go. It should be simple. I will adapt the existing scaper scripts.

https://www.dailymail.co.uk/textbased/channel-1/index.html

ricjhill commented 3 years ago

opened branch to deal with this.

alexn11 commented 3 years ago

Just realised that I forgot to comment about the tags: those comes from the source so could be anything really (they are not related to the GRIST/etc classes that we want to produce). If none is given it's probably better to leave it as an empty string.

ricjhill commented 3 years ago

merged feature branch so closing this ssue