This repository contains the code to generate predictions of critical violations at food establishments in Chicago. It also contains the results of an evaluation of the effectiveness of those predictions.
Data file names now mirror the script names that created the files
Features on food inspections are now calculated separately
Features on business inspections are now calculated separately
The model code merges in the features, does not calculate features
Added script to adjust the public sanitarian data to match the schema of the private sanitarian file
More aggressive filtering functions
Separates out the violation matrix calculation into the parsing step and classification step (which, as it turns out will be useful for the new inspection format)
Refactoring model result / evaluation steps to accommodate future analysis
Data file names now mirror the script names that created the files
Features on food inspections are now calculated separately
Features on business inspections are now calculated separately
The model code merges in the features, does not calculate features
Added script to adjust the public sanitarian data to match the schema of the private sanitarian file
More aggressive filtering functions
Separates out the violation matrix calculation into the parsing step and classification step (which, as it turns out will be useful for the new inspection format)
Refactoring model result / evaluation steps to accommodate future analysis
Several changes in the code