Open thwhitfield opened 4 years ago
Fire quality scores have been applied to fire station IDs using the code in thw_1.1_fire_station_EDA.ipynb. Once the NFIRS dataset is updated (https://github.com/DataKind-DC/rcp2/issues/27 - NFIRS Fire Incident Data.csv - Investigate Null Values), that notebook should be re-ran to get fire quality scores for each Fire station ID
Data sources: NFIRS, ARC Response
[ ] Fire Reporting Quality Assessment Step 1 - Determine statistical outliers for NFIRS reporting at the County level using year-over-year and month-over-month reporting. Assign confidence score for county reporting consistancy. Take into account "zero" reporters. | NFIRS
[ ] Fire Reporting Quality Assessment Step 2 - Determine possible outliers for NFIRS reporting at the county level by looking at annual ARC reported totals and comparing to NFIRS. NFIRS should be greater | NFIRS, ARC Response
[ ] Fire Reporting Quality Assessment Step 3 - Determine possible NFIRS reporting deficiencies for census tracts, e.g. FD reports fire consistently, but only in 5 of county's 10 tracts. Determine best way to interpret data. If possible, assign a census tract reporting score to county, e.g. percentage of tracts where fires were reported based on aggregate data.