codeforboston / clean-slate-data

MIT License
27 stars 13 forks source link

Does the PA dataset have the same crime rates as the FBI data? #8

Closed mikemahoney218 closed 4 years ago

mikemahoney218 commented 5 years ago

PA was selected for this analysis because (a) they have more open privacy laws and (b) in an FBI dataset, PA had the most similar rates of crime as MA.

We need to validate that (b) remains true within our detailed dataset! As we don't have access to a detailed dataset for MA (or else we'd use it instead), instead we have to make sure the detailed dataset is close to the FBI summarized data.

This is a child issue of https://github.com/codeforboston/clean-slate/issues/6

mikemahoney218 commented 5 years ago

A task dependent upon this one: If they are not, can we downsample such that they are representative?

jeremylang commented 4 years ago

No longer need a proxy state now that MA data is available.