LibraryOfCongress / api.congress.gov

congress.gov API
605 stars 38 forks source link

Hearing API data not fresh/accurate #171

Open dylanirlbeck opened 7 months ago

dylanirlbeck commented 7 months ago

Hi all. I have reason to believe that the /hearing endpoint is not returning fresh/accurate data for hearings. For example, this morning I curled the /hearing/senate endpoint to find the list of Senate hearings. I documented the result in a GitHub Gist.

Several upcoming and recent Senate hearings that I expected to be present in the dataset were not. Examples:

I'm going to switch to the /committee-meeting endpoint to see if I can get the data I need, but I would appreciate someone looking into the /hearing endpoint to see what's happening. Thanks!

dylanirlbeck commented 7 months ago

Is it possible that Senate hearings are simply not being coded correctly when they're entered into the system / scraped from senate.gov? All the Senate hearings are showing up as general committee meetings. This is in contrast to the House, which seems to better discriminate between hearings and meetings.

Screenshot 2023-11-10 at 12 28 01 PM Screenshot 2023-11-10 at 12 28 08 PM
104PL104 commented 7 months ago

Yes, you are correct that better structured data from the Senate is needed in order to differentiate between 'hearings' and 'business meetings', for example. We are in discussion with our Senate data partner about this requirement.

dylanirlbeck commented 6 months ago

@104PL104 @apreiter18 Any update on this? Thanks a bunch!

104PL104 commented 6 months ago

We do not yet have a timeline from our Senate data partner.