openelections / openelections-data-tx

Pre-processed results for Texas elections
30 stars 27 forks source link

2014 General Precinct Results #11

Open dwillis opened 6 years ago

dwillis commented 6 years ago

Using Tabula, OCR or whatever method you can, parse precinct-level results for the following counties. Original sources are in the sources-tx repository.

The goal is to create a single CSV file for each county, with the following headers:

county, precinct, office, district, party, candidate, votes

If the county file also provides a breakdown of votes by method, include that using the following headers:

early_voting, election_day, provisional

Include the following offices:

The CSV files should be named 20141104__tx__general__{county}__precinct.csv. Here's an example finished file from 2016: https://github.com/openelections/openelections-data-tx/blob/master/2016/20161108__tx__general__sabine__precinct.csv.

timtrice commented 6 years ago

@dwillis , I have parsed Sabine county but have a couple of questions:

  1. Early votes are not broken down by precinct. The example shown above has an "early_votes" column per precinct. Based on the original post I have ignored this variable. Is this correct?

  2. Candidate is not broken down by party. Does this require manual lookup or is there a reference table available?

  3. What types of offices do "district" apply? I can easily parse "US Rep", "Member State Board of Education", "State Senator", and "State Rep". I did not include "Chief Justice, 12th Court of Appeals District", "District Judge, 273rd Judicial District".

dwillis commented 6 years ago

Hey @timtrice, thanks!

  1. Yeah, if Sabine doesn't have early votes broken out, you can skip that column (and election_day).
  2. You can use this to lookup parties: http://elections.sos.state.tx.us/elchist175_state.htm
  3. District applies to US Rep, State Senator and State Rep. You can skip the judicial races and board of education.
AngeloCioffi commented 6 years ago

Hey @dwillis! I'm working on the Fort Bend county data. I have a few questions:

  1. Do we want registered voters and ballots cast as values to the office field and not their own columns? If so, I'm confused on what one of those rows would look like.
  2. Fort Bend county has absentee ballot information. Is their anything we want to do with that information?
dwillis commented 6 years ago

Hi @AngeloCioffi! Thanks for your questions:

  1. Yes! The row would look like this one: https://github.com/openelections/openelections-data-tx/blob/master/2014/20141104__tx__general__marion__precinct.csv#L4
  2. Yes, if we have absentee votes, they would get a separate column like early votes or election day votes.