I modified the bill parser. Now each record has 4 main categories: 'metadata', 'bill-type', 'body', 'file-name', and their sub-categories are separated by dot.
'metadata': metadata in original bill file
'bill-type': bill, resolution or am
'body': 'body.section' contains all the text ( for those body has been modified, only keep the last version)
'file-name': just file name
NOTE: It took longer than I expected, so I only focus on the body of
, It should be sufficient now. When I have time, I will add
I modified the bill parser. Now each record has 4 main categories: 'metadata', 'bill-type', 'body', 'file-name', and their sub-categories are separated by dot.
NOTE: It took longer than I expected, so I only focus on the body of
, It should be sufficient now. When I have time, I will add