The discrepancies in the Harvard dataset arise from the same issue described in https://github.com/kuriwaki/cvr_harvard-mit_scripts/issues/239. Our initial parse of the JSON CVR files reproduces the vote for president exactly. Once Jim rebuilds the long dta and @kuriwaki then rebuilds the parquet, we should be good to go.
I am closing issues that are only #fix-harvard (but MEDSL is correct) as "not planned" for now. The Harvard team should revisit them later by using the hashtag.
The discrepancies in the Harvard dataset arise from the same issue described in https://github.com/kuriwaki/cvr_harvard-mit_scripts/issues/239. Our initial parse of the JSON CVR files reproduces the vote for president exactly. Once Jim rebuilds the long dta and @kuriwaki then rebuilds the parquet, we should be good to go.