unitedstates / inspectors-general

Collecting reports from Inspectors General across the US federal government.
https://sunlightfoundation.com/blog/2014/11/07/opengov-voices-opening-up-government-reports-through-teamwork-and-open-data/
Creative Commons Zero v1.0 Universal
107 stars 21 forks source link

Are there any web scrapers that need to be written? #224

Closed jackiekazil closed 9 years ago

jackiekazil commented 9 years ago

Are there any web scrapers that need to be written for this repo?

@Sisiwei and I are teaching a web scraping class @ Pycon and we are looking for possible challenges for folks to dig into.

konklone commented 9 years ago

Our current scope is the 70+ federal IGs under the scope of the Council of Inspectors General, that publish reports online in a scrape-able place. We also have the House of Representatives IG in there.

There are a few places we could happily expand to cover more areas of oversight, I think.

Additionally, you could look over the IGs for whom we don't have any reliable report locations, and either verify that this is the case, or identify some reputable third party sources where we might find some. Those IGs are:

We'd also love it if people could identify any room for improvement in existing scrapers, or felt moved to tackle any of the open issues.

Thanks for inquiring, and for being interested in the project for your class! This was a useful thing to write down.

jackiekazil commented 9 years ago

@konklone this helpful! Thank you!