kuriwaki / cvr_harvard-mit_scripts

6 stars 1 forks source link

[CA] Kings #321

Closed aconevska closed 2 months ago

aconevska commented 2 months ago

Recommendation: Use MEDSL, also check Kings in H "01_direct-to-parquet-loop.R" code.

MEDSL has no issues for this county but the processed Harvard data is short 26 votes for president (cvr_status sheet reports 22 but I count 23 missing for Trump and 3 missing for Biden). Note that this is only in the Harvard processed parquet - i.e. after running "01_direct-to-parquet-loop.R" and "02_merge-party_snyder.R".

Jim's "CA_Kings_long.dta" file has the exact same count as the SOV, like MEDSL. And I find the same counts as the SOV in the raw "cvr.csv" with basically no cleaning.

I'm thinking this might be related to the same issues we find in Contra Costa and Sonoma currently, possibly an issue with the way Harvard is deduping. (https://github.com/kuriwaki/cvr_harvard-mit_scripts/issues/296#issuecomment-2197357404) (https://github.com/kuriwaki/cvr_harvard-mit_scripts/issues/300)

kuriwaki commented 2 months ago

I am closing issues that are only #fix-harvard (but MEDSL is correct) as "not planned" for now. The Harvard team should revisit them later by using the hashtag.