Data4Democracy / incarceration-trends

Analysis of incarceration data to inform bail reform legislation in Colorado. Data for Democracy x ACLU of Colorado.
6 stars 4 forks source link

data integrity of pretrial snapshot #3

Open ghonk opened 5 years ago

ghonk commented 5 years ago

The data here don't appear to be consistent re: use of missing data labels and have a number of other issues that warrant discussion.

For example on the RacialEthnic Data tab the data contain N/A, blanks, replied "unknown", and unknown for missing values. There are comments in the data that indicate that multi-racial individuals are assigned one of their reported races.

The Hold Breakdowns tab has a similar problem, there are also non-integer data (despite being a count for inmates).

It would be good to document these issues and try to get clarity from the project partner (ACLU?)

charlottemcclintock commented 5 years ago

Definitely something we should run by the ACLU contact! I'll make a doc in the Google Drive with questions.

akelleh commented 5 years ago

The Racial/Ethnic tab also has non-integer data for Pueblo! Looks like Pueblo is the only one with non-integer values, across all tabs.

ghonk commented 5 years ago

2/3 of the meaningful datasets in this xlxs doc have been converted to usable format (needs review)