usgpo / bill-status

Information about Bill Status XML Bulk Data including the XML User Guide.
https://www.govinfo.gov/bulkdata/BILLSTATUS
154 stars 46 forks source link

Duplicate fields for amendments/amendment (113sconres8, amendment #710) #60

Open aih opened 7 years ago

aih commented 7 years ago

See https://www.gpo.gov/fdsys/bulkdata/BILLSTATUS/113/sconres/BILLSTATUS-113sconres8.xml It has duplicate fields for the first listed amendment, including: type, purpose, congress and description

llaplant commented 7 years ago

@aih Thank you for bringing this to our attention. Source data from the Library of Congress provides amendment type, purpose, congress, and description in two different places. We updated our processing to only include from one place instead of from both places. The file you referenced in the issue https://www.gpo.gov/fdsys/bulkdata/BILLSTATUS/113/sconres/BILLSTATUS-113sconres8.xml has been reprocessed and updated.

aih commented 7 years ago

@llaplant Thanks!

llaplant commented 7 years ago

This morning we had to revert this change because it introduced a bug. I am reopening this issue.

aih commented 7 years ago

Maybe continue to check both sources and perform a duplicate check before updating billstatus? It's not highest priority, since the data is not incorrect, just redundant.