boredzo / ADEM-analyzer

BSD 3-Clause "New" or "Revised" License
0 stars 0 forks source link

Ingest posted results and analyze/compare them #4

Open boredzo opened 3 years ago

boredzo commented 3 years ago

If you have a copy of all the Assembly District pages with the posted results (including vote counts!) on them, it should be possible to compare those vote counts to our own counts of votes found in valid ballots.

This will likely involve using Beautiful Soup to scrape the HTML in each file.

boredzo commented 3 years ago

As described in the README, these pages should be expected in ./posted-results/ad-99.html (replace 99 with each AD number).

I would say this step should be optional; if posted-results is missing or empty, no biggie.

boredzo commented 3 years ago

It may also be a good idea to have a separate tool for converting scraped HTML to CSV.

boredzo commented 3 years ago

One step toward this happened in a1768e9868c8631fc4c8dd056f7375805c7d0135: analyze.py ingests the HTML pages and builds a CSV of the posted results. No comparison to independent scans yet, though.