happyjack27 / autoredistrict

Programmatically makes a fair congressional district map (prevents gerrymandering)
GNU General Public License v3.0
89 stars 15 forks source link

better data source for election data? #3

Open happyjack27 opened 8 years ago

happyjack27 commented 8 years ago

one thing i've struggled with a lot is getting good clean data on vote counts at the voting tabulation district level. I've been able to get clean vtd-level shapefiles and population data from census.gov, but election data seems to be all over the place and varied not only in format, but quality. e.g. i've found some shapefiles with so many geometric areas that they are unusable, etc.

would like nice clean, centralized way to import this data, despite the apparent great variety in source, format, and quality of data depending on state.

carlschroedl commented 8 years ago

Oh brother, it's disappointing that the election data isn't available more consistently. At work we have a few apps that aggregate data from multiple independent organizations. It can be a lot of work to iron out the nuances of the different data providers. It always helps to have a machine-verifiable representation of what you would like in the end. So, for example, if you were using various XSLTs (one per independent data provider) to transform XML web service responses into a common format, it would be important to define an XSD for the common format, and perhaps some code that could perform higher-level validation logic on the transformed output. In this case, a good start would be pursuing the definition of a machine-verifiably "good" target.

carlschroedl commented 8 years ago

I couldn't find anything relevant on a quick look at data.gov, but I suspect such a consolidated dataset would live in there if it exists. You might be able to supply better search terms.

happyjack27 commented 8 years ago

yeah, a consistent formal would be nice. not a big fan of xml though, it's kludgy, bulky, inelegant... json is much better. and would rather just have it in a shapefile, or failing that, tab-delimited, keyed on geoid at the vtd-level, matching some standard datasource for the shapefile (such as census.gov)

i have two data source in the "file" menu, havard election data archive and another one, they are kind of meant to be a central store, but unfortunately they're rather hit-or-miss on a number of levels. a bit disappointed. was expecting better quality from, you know, harvard.

On Fri, Jan 15, 2016 at 10:38 PM, Carl Schroedl notifications@github.com wrote:

I couldn't find anything relevant on a quick look at data.gov, but I suspect such a consolidated dataset would live in there if it exists. You might be able to supply better search terms.

— Reply to this email directly or view it on GitHub https://github.com/happyjack27/autoredistrict/issues/3#issuecomment-172156984 .

carlschroedl commented 8 years ago

Yeah, sorry if I was unclear -- I'm not pushing for any particular serialization of the data, I was just drawing on an example that happens to use XML. I'm suggesting the pursuit of some sort of analog in the new context of this voting data.

happyjack27 commented 8 years ago

data sources for all but election data:

http://www.census.gov/rdo/pdf/StrengthInNumbers2010.pdf

https://www.census.gov/rdo/data/