sc3 / cook-convictions

The Tarbell project that generates Convicted in Cook
http://convictions.smartchicagoapps.org/
6 stars 1 forks source link

Export the data in publishable form #110

Closed Yana715 closed 9 years ago

Yana715 commented 10 years ago

Use a Google Spreadsheet for this as it allows for easy preview of the data as well as downloading in CSV (and other) formats.

ghing commented 10 years ago

I've implemented code to bake out a CSV and to anonymize the addresses. However, I found that the data is too big to import into Google Spreadsheets. I'm going to try to split by year and see if that works as I feel strongly that Google Spreadsheets is a good way to go to make the data more explorable. We can also offer a download of the CSV with all records.

I'm going to try to get this done today.

Bonus from this work: Once I get clear of the immediate needs of this project, I should be able to push Python 3 support upstream to datamade's awesome usaddress package.

ghing commented 10 years ago

Even when I slice the data by year (using the initial_date field) the largest file is ~25M which is still too big for Google Drive to convert into a spreadsheet. Raw CSV it is, but I think it still makes sense to upload it to Drive for revision control and overall ease of management.

ghing commented 10 years ago

I created a "Public Data" subfolder in Drive to hold the baked dispositions CSV.

ghing commented 9 years ago

This is done now. Closing.