Closed aconevska closed 4 months ago
The routine for me to figure out the "why off" has become:
Is it just one vote method (eg mail) that is entirely missing everywhere?
If not, is it entire precincts missing somewhere? You can use the parquet in the Dropbox (returns/) which is standardized the same way as CVRs, or you can get official precinct returns from the website
Fwiw.
MEDSL matches Harvard now, but they both still seem wildly off. I don't think it's worth it to invest any more time since we are missing so many votes, but perhaps something useful for the future. Closing for now.
Recommendation: Use Harvard, though likely not ready for release.
Santa Clara County office counted 863,964 total ballots for 2020. The raw cvr.csv file has 1,401,218 rows due to fragmentation.
Santa Clara County office reports 850,741 total votes cast for the President. The Harvard processed file as 350,244 votes for President.
I am still trying to see whether we can get closer to the County office count. Its not clear why we're so far from the official count.