unitedstates / wish-list

A wish list for this organization, open an Issue to discuss what we can add. Derived from a News Foo session.
https://github.com/unitedstates/wish-list/issues
16 stars 3 forks source link

Committee scraper #11

Open schmod opened 11 years ago

schmod commented 11 years ago

A lot of my work revolves around committee hearings, and I've noticed that GovTrack, Sunlight, NYT, and the other usual suspects only track committee activity at the most basic levels. It seems like this very important part of our legislative process gets routinely ignored.

It'd be pretty awesome to have a scraper that aggregated all committee-related activity into an easily-digestible data source.

Right now, the data is tied up in fdsys, committee sites, and the daily digest. There's absolutely no good way to automatically cross-reference any of it.

I don't have a specific proposal to put forward, but a congress-committees repository might be a valuable thing. A lot of the groundwork has already been done in congress-legislators.

Things we might want to aggregate:

Again, this isn't a concrete proposal, but it's something that I've been tossing around in the back of my head.

konklone commented 11 years ago

I'm a huge proponent of committee information. Committee voting records, in particular, never surface in any free data source, because of how difficult they are to collect.

You've described really well the disparate nature of the data - which is why it hasn't been done yet. Is this something you'd be interested in leading the way on?

I'll note that the House's new Committee Repository has a lot of data now, and a lot of potential. It may have some of what you've asked for above, for individual hearings. The Clerk has also been explicit about growing its scope to include here-to-fore unpublished or decentralized information, such as voting records.