Open waldoj opened 6 years ago
Right now we're scraping every summary. The legislature has started publishing Summaries.csv on their FTP site, with these headers:
Summaries.csv
"SUM_BILNO","SUMMARY_DOCID","SUMMARY_TYPE","SUMMARY_TEXT"
Get all summaries from this file, and stop scraping them with summaries.php.
summaries.php
Hang on—I'm dubious that the hash-checking system is working here.
Right now we're scraping every summary. The legislature has started publishing
Summaries.csv
on their FTP site, with these headers:Get all summaries from this file, and stop scraping them with
summaries.php
.