Open boredzo opened 3 years ago
As described in the README, these pages should be expected in ./posted-results/ad-99.html (replace 99 with each AD number).
I would say this step should be optional; if posted-results is missing or empty, no biggie.
It may also be a good idea to have a separate tool for converting scraped HTML to CSV.
One step toward this happened in a1768e9868c8631fc4c8dd056f7375805c7d0135: analyze.py ingests the HTML pages and builds a CSV of the posted results. No comparison to independent scans yet, though.
If you have a copy of all the Assembly District pages with the posted results (including vote counts!) on them, it should be possible to compare those vote counts to our own counts of votes found in valid ballots.
This will likely involve using Beautiful Soup to scrape the HTML in each file.