usgpo / bill-status

Information about Bill Status XML Bulk Data including the XML User Guide.
https://www.govinfo.gov/bulkdata/BILLSTATUS
155 stars 46 forks source link

recordedVotes xml links look badly formatted (missing 0 padding) #240

Open jerusalemmoorework opened 1 week ago

jerusalemmoorework commented 1 week ago

While processing bulk bill data and subsequent voting record data I noticed a failure due to a badly formatted link in some entries.

ex this entry https://www.govinfo.gov/bulkdata/BILLSTATUS/118/hr/BILLSTATUS-118hr400.xml

includes this link in a recordedVotes element https://clerk.house.gov/evs/2023/roll35.xml

the above link is dead and looks like it should actually be https://clerk.house.gov/evs/2023/roll035.xml

it looks like the links should be formatted with 0 padding but aren't when included in entries like https://www.govinfo.gov/bulkdata/BILLSTATUS/118/hr/BILLSTATUS-118hr400.xml

llaplant commented 1 week ago

Issue is present in Congress.gov API response data. GPO is coordinating with the Library of Congress Congress.gov Team. See also https://github.com/LibraryOfCongress/api.congress.gov.

apreiter18 commented 6 days ago

The Library has completed a fix for this situation in the Congress.gov API - https://api.congress.gov/v3/bill/118/hr/400/actions?api_key=[INSERT_KEY]

Thanks!

llaplant commented 6 days ago

Verified fix in https://www.govinfo.gov/bulkdata/BILLSTATUS/118/hr/BILLSTATUS-118hr400.xml. GPO will plan a reprocess of 118 BILLSTATUS to pick up fixes to any additional 118 BILLSTATUS files.

jerusalemmoorework commented 6 days ago

Thanks team