codeforboston / clean-slate-data

MIT License
27 stars 13 forks source link

How can we determine which individuals in the PA dataset have multiple offenses? #12

Closed mikemahoney218 closed 4 years ago

mikemahoney218 commented 5 years ago

We have a random sample of data (currently ~10% from 2018, but issue #10 hopes to widen that) of PA crimes, which makes it difficult to track which offenders are repeat offenders (and how often, etc etc etc) through the year. While #10 is ongoing and hopes to cast a much broader net, it would be good to spend time thinking about how we can identify repeat offenders if we can't get a more complete dataset.

knod commented 5 years ago

Would getting the summary sheets be helpful?

dawngraham commented 4 years ago

As a general note, we should try to be thoughtful about the language we use - i.e. using "people who have been convicted / with records in the criminal justice system / etc." instead of "offenders".

jeremylang commented 4 years ago

No longer need a proxy state now that MA data is available.