invinst / chicago-police-data

a collection of public data re: CPD officers involved in police encounters
https://invisible.institute/police-data
157 stars 60 forks source link

Double-check data in /Clean/May2016 folder #19

Closed jilmun closed 6 years ago

jilmun commented 8 years ago

Data was created in pull request #17

alexsoble commented 8 years ago

@rajivsinclair and @ithinkidunno – do you have your own attempts at merged / clean data that we can check against @yahwes's?

DGalt commented 8 years ago

(posted in slack, but figure it should go here too) So I started looking at this as well - @yahwes did you do when there was no obvious match. E.g. when there is a nan in the the Complaint_Number column, or if there is a Complaint_Number in one of the sheets in the pair but not other (e.g. 2012 Parties has the complaint number 10521242, but this does not exist in the 2012 Incid sheet)

alexsoble commented 8 years ago

@dgalt yeah I think doing as much of the work as possible in public is 👍🏽👍🏽👍🏽 On Fri, Jun 10, 2016 at 3:01 PM DGalt notifications@github.com wrote:

(posted in slack, but figure it should go here too) So I started looking at this as well - @yahwes https://github.com/yahwes did you do when there was no obvious match. E.g. when there is a nan in the the Complaint_Number column, or if there is a Complaint_Number in one of the sheets in the pair but not other (e.g. 2012 Parties has the complaint number 10521242, but this does not exist in the 2012 Incid sheet)

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/invinst/shootings-data/issues/19#issuecomment-225281284, or mute the thread https://github.com/notifications/unsubscribe/ADD5HUmd1-6D0ZS6R6ZpPzMN8yNR11vlks5qKcKPgaJpZM4Ix1cU .

jilmun commented 8 years ago

(posted in slack, but figure it should go here too) The 2012 CRID 1052142 is the only one that was in "parties" but not in "incid". In that case, the values I pulled in for the 13 columns in "incid" only (Beat:Incident_Time_End, CLOSEDATIPRA_DATETIME, Report_Status:Penalty_Status), are left blank.

DGalt commented 8 years ago

Going through @yahwes's process I get the same thing

alexsoble commented 8 years ago

👍🏽👍🏽

Anything else we should document here?

@yahwes what is a .feather file? On Fri, Jun 10, 2016 at 5:30 PM DGalt notifications@github.com wrote:

Going through @yahwes https://github.com/yahwes's process I get the same thing

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/invinst/shootings-data/issues/19#issuecomment-225311536, or mute the thread https://github.com/notifications/unsubscribe/ADD5HTlGclmwX9xmt3nsXWlP0JH2WIadks5qKeWXgaJpZM4Ix1cU .

jilmun commented 8 years ago

It's the new Python/R dataframe format.

alexsoble commented 8 years ago

Gotcha, thanks! @yahwes On Fri, Jun 10, 2016 at 5:52 PM yahwes notifications@github.com wrote:

It's the new Python/R dataframe format.

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/invinst/shootings-data/issues/19#issuecomment-225314587, or mute the thread https://github.com/notifications/unsubscribe/ADD5HfD32j66tco5H0KHyskkzoD3Yd0Eks5qKeqygaJpZM4Ix1cU .