Open DGalt opened 8 years ago
Here's a very quick first pass of trying to get a handle on how much of the data here is overlapping, based on complaint numbers (CRIDS).
Total number of unique CRIDS:
Number of CRIDs that do not overlap:
For the June - CPDB comparison, of the 2788 non-overlapping CRIDs, 2170 of them can be accounted for by non-overlapping time periods - i.e. CPDB, based on incident_date
ranges from 2011-03-13 to 2015-08-19 while the June set (also based on Incident_Datetime
) ranges from 2012-04-22 to 2016-04-22
For the April - CPDB comparison, of the 3349 non-overlapping CRIDs, 782 of them can be accounted for by non-overlapping periods - April (based on normalized Incident_Start_Time
) ranges from 1978-09-10 to 2016-04-12
Notebook from mob programming at CHN on Sept 6. We looked at June-CPDB overlapping CRIDs and how category codes changed.
We have 3 data sets that are marked as containing complaints data:
Some questions posed by @rajivsinclair for us to look at:
has some officer identified
,has some investigator identified
,has some final investigation/disciplinary outcomes
,has conflicting initial vs final outcomes
?)I will be leaving this issue open as a place to collect notes / analysis related to these questions