sdoctor7 / bill-summarization

Text Summarization of Congressional Bills: Fall 2017 Capstone project, Columbia University Data Science Institute and Bloomberg
0 stars 1 forks source link

Parse original bills #2

Open xl2602 opened 6 years ago

xl2602 commented 6 years ago

I modified the bill parser. Now each record has 4 main categories: 'metadata', 'bill-type', 'body', 'file-name', and their sub-categories are separated by dot.

NOTE: It took longer than I expected, so I only focus on the body of, It should be sufficient now. When I have time, I will add

and other parts with more detailed explanation.