codeforboston / clean-slate-data

MIT License
27 stars 13 forks source link

Get PA data for additional years #10

Closed mikemahoney218 closed 4 years ago

mikemahoney218 commented 5 years ago

Our PA dataset was scraped from public-facing sources using code stored in a Jupyter notebook (which @sheldonchan has access to). We're hoping to find that notebook, in order to get PA data for additional years.

We should also quantify how many records from what timespan we are hoping to scrape.

sheldonchan commented 5 years ago

The year is hardcoded to 2018 in this, and also a separate script in the same repo was used to establish what the range of numbers were in order to do a random sample: https://github.com/EnMedina/Clean-Slate/blob/master/datascraper/RandomCaseGetter.py

jeremylang commented 4 years ago

No longer need a proxy state now that MA data is available.