2022 Primary Precinct Results

dwillis commented 2 years ago

Using Tabula, OCR or whatever method you can, parse precinct-level results for the following counties. Original sources are in the sources-tx repository.

The goal is to create a single CSV file for each county, with the following headers:

county, precinct, office, district, party, candidate, votes

If the county file also provides a breakdown of votes by method, include that using the following headers:

early_voting, election_day, provisional, mail

If there are other possible vote types, include them, using a lowercase version of the vote type with underscores instead of spaces for the column name.

Include the following offices:

Registered Voters (if available)
Ballots Cast (if available)
Governor
Lieutenant Governor
Attorney General
Comptroller of Public Accounts
Commissioner of the General Land Office
Commissioner of Agriculture
Railroad Commissioner
U.S. House
State Senate
State Representative

If a county provides precinct results for Write-in candidates, they should be grouped in a single row for each precinct and office with a candidate value of Write-ins.

If a county provides Under Votes or Over Votes, those should be recorded in the same way, with a single row per precinct and office with Over Votes and Under Votes as the candidate values.

[x] Anderson County
[ ] Andrews County
[ ] Angelina County
[ ] Aransas County
[ ] Archer County
[ ] Armstrong County
[ ] Atascosa County
[x] Austin County
[x] Bailey County
[x] Bandera County
[ ] Bastrop County
[ ] Baylor County
[ ] Bee County
[ ] Bell County
[ ] Bexar County
[ ] Blanco County
[ ] Borden County
[ ] Bosque County
[ ] Bowie County
[ ] Brazoria County
[ ] Brazos County
[ ] Brewster County
[ ] Briscoe County
[ ] Brooks County
[ ] Brown County
[ ] Burleson County
[ ] Burnet County
[ ] Caldwell County
[x] Calhoun County
[ ] Callahan County
[x] Cameron County
[ ] Camp County
[x] Carson County
[ ] Cass County
[ ] Castro County
[ ] Chambers County
[ ] Cherokee County
[ ] Childress County
[x] Clay County
[ ] Cochran County
[ ] Coke County
[x] Coleman County
[ ] Collin County
[ ] Collingsworth County
[ ] Colorado County
[ ] Comal County
[ ] Comanche County
[ ] Concho County
[ ] Cooke County
[ ] Coryell County
[ ] Cottle County
[ ] Crane County
[ ] Crockett County
[ ] Crosby County
[ ] Culberson County
[x] Dallam County
[ ] Dallas County
[ ] Dawson County
[ ] Deaf Smith County
[ ] Delta County
[ ] Denton County
[ ] DeWitt County
[ ] Dickens County
[ ] Dimmit County
[x] Donley County
[ ] Duval County
[ ] Eastland County
[ ] Ector County
[ ] Edwards County
[x] Ellis County
[ ] El Paso County
[ ] Erath County
[ ] Falls County
[ ] Fannin County
[ ] Fayette County
[ ] Fisher County
[ ] Floyd County
[ ] Foard County
[ ] Fort Bend County
[ ] Franklin County
[ ] Freestone County
[ ] Frio County
[ ] Gaines County
[ ] Galveston County
[ ] Garza County
[ ] Gillespie County
[ ] Glasscock County
[x] Goliad County
[ ] Gonzales County
[ ] Gray County
[ ] Grayson County
[ ] Gregg County
[ ] Grimes County
[ ] Guadalupe County
[ ] Hale County
[ ] Hall County
[ ] Hamilton County
[ ] Hansford County
[ ] Hardeman County
[ ] Hardin County
[ ] Harris County
[ ] Harrison County
[x] Hartley County
[ ] Haskell County
[ ] Hays County
[x] Hemphill County
[ ] Henderson County
[ ] Hidalgo County
[ ] Hill County
[ ] Hockley County
[ ] Hood County
[ ] Hopkins County
[ ] Houston County
[ ] Howard County
[ ] Hudspeth County
[ ] Hunt County
[ ] Hutchinson County
[ ] Irion County
[ ] Jack County
[ ] Jackson County
[ ] Jasper County
[ ] Jeff Davis County
[ ] Jefferson County
[ ] Jim Hogg County
[ ] Jim Wells County
[ ] Johnson County
[x] Jones County
[ ] Karnes County
[ ] Kaufman County
[ ] Kendall County
[ ] Kenedy County
[ ] Kent County
[ ] Kerr County
[ ] Kimble County
[ ] King County
[ ] Kinney County
[ ] Kleberg County
[ ] Knox County
[ ] Lamar County
[x] Lamb County
[ ] Lampasas County
[ ] La Salle County
[ ] Lavaca County
[ ] Lee County
[x] Leon County
[ ] Liberty County
[x] Limestone County
[ ] Lipscomb County
[ ] Live Oak County
[ ] Llano County
[ ] Loving County
[ ] Lubbock County
[ ] Lynn County
[ ] McCulloch County
[ ] McLennan County
[ ] McMullen County
[ ] Madison County
[ ] Marion County
[ ] Martin County
[ ] Mason County
[ ] Matagorda County
[ ] Maverick County
[ ] Medina County
[ ] Menard County
[ ] Midland County
[ ] Milam County
[ ] Mills County
[ ] Mitchell County
[ ] Montague County
[ ] Montgomery County
[x] Moore County
[ ] Morris County
[ ] Motley County
[ ] Nacogdoches County
[ ] Navarro County
[ ] Newton County
[x] Nolan County
[ ] Nueces County
[ ] Ochiltree County
[x] Oldham County
[x] Orange County
[ ] Palo Pinto County
[ ] Panola County
[ ] Parker County
[ ] Parmer County
[ ] Pecos County
[ ] Polk County
[ ] Potter County
[ ] Presidio County
[ ] Rains County
[ ] Randall County
[ ] Reagan County
[ ] Real County
[ ] Red River County
[ ] Reeves County
[ ] Refugio County
[ ] Roberts County
[x] Robertson County
[ ] Rockwall County
[ ] Runnels County
[ ] Rusk County
[ ] Sabine County
[ ] San Augustine County
[ ] San Jacinto County
[ ] San Patricio County
[ ] San Saba County
[ ] Schleicher County
[ ] Scurry County
[ ] Shackelford County
[ ] Shelby County
[ ] Sherman County
[ ] Smith County
[ ] Somervell County
[ ] Starr County
[ ] Stephens County
[ ] Sterling County
[ ] Stonewall County
[ ] Sutton County
[x] Swisher County
[ ] Tarrant County
[ ] Taylor County
[ ] Terrell County
[ ] Terry County
[ ] Throckmorton County
[ ] Titus County
[ ] Tom Green County
[x] Travis County
[ ] Trinity County
[ ] Tyler County
[ ] Upshur County
[ ] Upton County
[ ] Uvalde County
[ ] Val Verde County
[ ] Van Zandt County
[ ] Victoria County
[ ] Walker County
[ ] Waller County
[ ] Ward County
[ ] Washington County
[ ] Webb County
[ ] Wharton County
[ ] Wheeler County
[ ] Wichita County
[ ] Wilbarger County
[ ] Willacy County
[ ] Williamson County
[ ] Wilson County
[ ] Winkler County
[ ] Wise County
[ ] Wood County
[ ] Yoakum County
[ ] Young County
[ ] Zapata County
[ ] Zavala County

thefuturewasnow commented 2 years ago

@dwillis I'd love to help but not sure where to start. I was looking at this file for Anderson County and not sure how to parse the first set of numbers. I'm sure others will do this quickly and/or better, but felt like I could contribute a few hours if it's helpful. Thanks for all the work you do!

dwillis commented 2 years ago

@thefuturewasnow thanks! Yeah, those txt files can be tricky, so I might suggest something a bit easier like maybe Bee County, which basically involves reworking a CSV file into a slightly different format. You can see the result we want by looking at files I've converted for Austin, Bailey and Bandera counties in the 2022 folder.

ssdatar commented 2 years ago

I can get some of this, too. Is it okay if I add a code directory within the 2022 directory if I wrote a Python script?

dwillis commented 2 years ago

@ssdatar thanks! we've got a python-parsers directory in the base directory of the repository if that works?

ssdatar commented 2 years ago

Yep, I'll add it to the python-parsers directory.

ssdatar commented 2 years ago

@dwillis Sorry, quick question. I found some counties are in this format. Are these '000' separated?

I can see that in some counties, the last digits in the first column are the vote totals. Is that accurate? Just trying to understand the organization. If there's any documentation or a parser already in the repo that works with these kinds of files, please let me know. Thanks!

dwillis commented 2 years ago

@ssdatar no worries! these are actually fixed-width files, not delimited by '000' or anything. Depending on the number of votes-related columns, we've got multiple parsers for them in the python-parsers directory that begin with asc. I've been working my way through some of those.

rdavis27 commented 2 years ago

@dwillis - I think that I've figured out the general parsing and am working on a short R program to do that. Is it possible that we may want to have an R-parsers directory as well as a python-parsers directory? I know that Python is more the standard but both languages are similar and in heavy use. In any case, I've run into a few issues. One is that there's a lack of standardization in the input file names. I have a list of the Texas counties so I could possibly cycle through that list and read the files that begin with the county name. Each county should have one with "DEMOCRATIC" in it and one with "REPUBLICAN" in it. That's assuming that the names follow that standard. But then I noticed that some of the counties are missing. Specifically, Harris and Maverick Counties are missing but there are likely others.

All of this suggests that it would be best if I could download the entire directory so that I could run the program on it. Is there a simple way to do that? I tried to clone the Texas repository but it was too huge for the free space on my drive. However, I also realized that I might need to do that if I am going to check anything into the repository at some point. Do we have a document that describes how we should do these basic operations on this repository? Thanks.

openelections / openelections-data-tx

2022 Primary Precinct Results #430