Open carlsopa opened 6 years ago
@carlsopa No one is working on this data, so it's wide-open if you would like to, thanks! I think your judgment about this being a good candidate for Tabula is correct; you could use Tabula to parse it and then re-arrange the results either manually or via a script.
Interested into looking at the data for the 2004 election. I have a few questions/concerns with this: 1) is anyone currently working on this data set? 2) the data set is in .pdf form. The data columns have no discerning breaks within the text data. I feel the easiest way to access and clean the data up would be to manually turn the file into a .csv file using tabula and scrape that file.