usgpo / bulk-data

User Guides for XML on the govinfo Bulk Data Repository. For information about Bill Status XML Bulk Data, see https://github.com/usgpo/bill-status.
https://www.govinfo.gov/bulkdata
266 stars 97 forks source link

Amendment and amended amendment activity in bulk data respository #27

Open jasonargoargo opened 6 years ago

jasonargoargo commented 6 years ago

What are your plans to add xml and zips of amendment activity to the bulk data repository, or add them into the existing 'BILLSTATUS' xml and zip files?

llaplant commented 6 years ago

Hi Jason, you may want to check out amendment activity in the BILLSTATUS bulk data at https://www.govinfo.gov/bulkdata/BILLSTATUS/115. You may also want to take a look at the BILLSTATUS repo for more information https://github.com/usgpo/bill-status. The User Guide is here https://github.com/usgpo/bill-status/blob/master/BILLSTATUS-XML_User_User-Guide.md and this is a link to the section on amendments https://github.com/usgpo/bill-status/blob/master/BILLSTATUS-XML_User_User-Guide.md#amendments.

jasonargoargo commented 6 years ago

Nice to meet you, Lisa! My program parses xml data. The problem with the amendments section of the BILLSTATUS files is they include everything... except the actual text of the amendment. Though we do get urls (that could be parsed from), they only lead to html, not xml.

The same goes for committee and floor votes. I can parse everything but the actual vote counts, who voted in what way, etc.

And finally, the same goes for CBO Cost Estimates though I imagine that is outside your purview.

I am available to speak over the phone if you need more context or if you would like me to help you in any way. I live and work here in DC.

jasonargoargo commented 6 years ago

May I get a status update on this issue?

llaplant commented 6 years ago

@jasonargoargo, this item is on our backlog; much of the source data is not yet available in XML.

jasonargoargo commented 6 years ago

I understand that you can't really give me an exact time frame for when this source data will be available for you to upload because you're depending on the deliverables of another office, but what else can be done to get this source data to you so that you can get it to us? What can I do to speed up the process?