jasonasher / dc_doh_hackathon

Repository for the DC DOH Hackathon on September 23rd, 2017
5 stars 28 forks source link

Extract 'Rodent or Trash Violations' Feature from Restaurant Inspection Data #18

Open jasonasher opened 6 years ago

jasonasher commented 6 years ago

Start with the DC DOH Food Service Establishment Inspection report data in the /Data Sets/Restaurant Inspections/ folder in Dropbox.

Develop a script to extract the number of food establishment inspections that found rodent or trash-related violations (violations 38 or 54). More details on violations can be found here

Note that this issue depends upon the geocoding results from Issue #13

Input: CSV files with inspection summary and violation details

Output: A CSV file with

feature_id: The ID for the feature, in this case, "restaurant_violations_rodent_or_trash" feature_type: The establishment_type from the restaurant data set feature_subtype: The risk_category from 1-5 year: The ISO-8601 year of the feature value week: The ISO-8601 week number of the feature value census_block_2010: The 2010 Census Block of the feature value value: The value of the feature, i.e. the number of inspections that found rodent or trash-related violations in establishments with the given types and risk categories in the specified week, year, and census block.

When you are finished Submit a pull request on GitHub (or upload your scripts) Upload any files to Dropbox

Need more information? Flag Jason or Elizabeth, or ask your question in the comments below and we'll respond as soon as we can!

xavier-gutierrez commented 6 years ago

working on this

kelsonSS commented 6 years ago

Finished this. Uploading now

zacharyclement commented 6 years ago

Working on this now... I'll upload and post when finished

jasonasher commented 6 years ago

I recommend using the most recent data from dc_restaurant_inspections

That set contains a 'violation description' column as well as a 'violation number' to deal with the fact that violation numbers have different meanings over time.

zacharyclement commented 6 years ago

I finished this. I uploaded the files to dropbox.

eclee25 commented 6 years ago

Migrated this issue to codefordc/the-rat-hack repository as issue_13. The migrated issue simply checks the hackathon code and modifies it to run from the command line.