Data4Democracy / media-crawler

Web scraper for generating a graph of media connections via articles, twitter, reddit, and more
31 stars 9 forks source link

Parse The Hill Articles #7

Open josephpd3 opened 7 years ago

josephpd3 commented 7 years ago

Using the WashingtonPost parser as an example, we want to create another parser for this source. Note: As of now, we only care to grab anchor tag <a> references.

This will involve a few things:

When submitting a PR for this, please include some sample references which you scraped from a source. We can work through cleaning it and getting it right if it comes down to it :)

brycecf commented 7 years ago

@josephpd3 I have this implemented. I'll make a pull request later on today.