sfbrigade / datasci-firerisk

This project attempts to model and acquire data from SF OpenData - and other sources - to predict the relative risk of fire in San Francisco’s buildings and public spaces.
http://codeforsanfrancisco.org/projects/SF-Fire-Risk-Project
10 stars 9 forks source link

Create feature data set from 'matched_Eviction_Notices.csv' #11

Open stahlerk opened 6 years ago

stahlerk commented 6 years ago

1) Subset data to potentially useful features 2) Detect and remove outliers 4) Collapse data at EAS level 5) Create any potentially relevant features 6) Any other data cleaning and standardization operations 7) Output as .csv (indexed at EAS)

sewardlee337 commented 6 years ago

Data Dictionary: https://data.sfgov.org/Housing-and-Buildings/Eviction-Notices/5cei-gny5