kuriwaki / cvr_harvard-mit_scripts

6 stars 1 forks source link

[CA] Santa Clara County #272

Closed aconevska closed 4 months ago

aconevska commented 4 months ago

Recommendation: Use Harvard, though likely not ready for release.

Santa Clara County office counted 863,964 total ballots for 2020. The raw cvr.csv file has 1,401,218 rows due to fragmentation.

Santa Clara County office reports 850,741 total votes cast for the President. The Harvard processed file as 350,244 votes for President.

I am still trying to see whether we can get closer to the County office count. Its not clear why we're so far from the official count.

kuriwaki commented 4 months ago

The routine for me to figure out the "why off" has become:

  1. Is it just one vote method (eg mail) that is entirely missing everywhere?

  2. If not, is it entire precincts missing somewhere? You can use the parquet in the Dropbox (returns/) which is standardized the same way as CVRs, or you can get official precinct returns from the website

Fwiw.

mreece13 commented 4 months ago

MEDSL matches Harvard now, but they both still seem wildly off. I don't think it's worth it to invest any more time since we are missing so many votes, but perhaps something useful for the future. Closing for now.