invinst / chicago-police-data

a collection of public data re: CPD officers involved in police encounters
https://invisible.institute/police-data
157 stars 60 forks source link

Comparing Complaints data sets #58

Open DGalt opened 8 years ago

DGalt commented 8 years ago

We have 3 data sets that are marked as containing complaints data:

Some questions posed by @rajivsinclair for us to look at:

I will be leaving this issue open as a place to collect notes / analysis related to these questions

DGalt commented 8 years ago

Here's a very quick first pass of trying to get a handle on how much of the data here is overlapping, based on complaint numbers (CRIDS).

Total number of unique CRIDS:

Number of CRIDs that do not overlap:

For the June - CPDB comparison, of the 2788 non-overlapping CRIDs, 2170 of them can be accounted for by non-overlapping time periods - i.e. CPDB, based on incident_date ranges from 2011-03-13 to 2015-08-19 while the June set (also based on Incident_Datetime) ranges from 2012-04-22 to 2016-04-22

For the April - CPDB comparison, of the 3349 non-overlapping CRIDs, 782 of them can be accounted for by non-overlapping periods - April (based on normalized Incident_Start_Time) ranges from 1978-09-10 to 2016-04-12

jayqi commented 7 years ago

Notebook from mob programming at CHN on Sept 6. We looked at June-CPDB overlapping CRIDs and how category codes changed.

https://github.com/jayqi/mob_programming_2016-09-06/blob/master/Mob%20Programming%20-%20Sept%206%2C%202016.ipynb