Open seanjtaylor opened 7 years ago
Another update from @scottcame:
Well the FBI actually has several different classification schemes. There is UCR (been in use with some mods since the 1930s), NIBRS (extension of UCR), NCIC... And then some agencies define their own. Nearly all agencies report UCR (summary) data to the FBI via their state UCR program. Only about 35% of agencies submit NIBRS (incident level data). Some of this reporting involves manual categorization of offenses and arrests Most state UCR programs send data monthly or so, but it varies and some are very infrequent. For larger jurisdictions it should be possible to map to the nibrs codes, which is probably the right level for what you're doing Much of it is CAD data that does not contain incident or offense/arrest info
People being referenced as experts on this:
Some helpful info from https://ucr.fbi.gov/crime-in-the-u.s/2016/preliminary-semiannual-uniform-crime-report-januaryjune-2016 (regarding the rape category and a caution against ranking)
PLEASE NOTE In 2013, the FBI’s UCR Program initiated the collection of rape data under a revised definition within the Summary Based Reporting System. The term “forcible” was removed from the offense name, and the definition was changed to “penetration, no matter how slight, of the vagina or anus with any body part or object, or oral penetration by a sex organ of another person, without the consent of the victim.”
The number of rape incidents reported using the revised definition, as well as the number of rapes submitted using the legacy definition, are included in this report in separate columns in each table. The rape figures for those agencies that changed from reporting rape under the legacy definition in 2015 to the revised definition in 2016 are not included in trend calculations in Tables 1-3, but they are reported in Table 4 for agencies 100,000 and more in population. Please note: Rape data reported for 2015 and 2016 cannot be aggregated by all agencies. Instead, two distinct groups of agencies (those reporting using the legacy definition and those reporting using the revised definition) are used for calculating trends. Therefore, the percent changes from one year to the next within each group are calculated with fewer agencies than in recent years. Offenses with fewer counts are often sensitive to minor differences when calculating trends. More information about this subject is presented in footnotes and data declarations for each table.
Caution against ranking Figures used in this Report were submitted voluntarily by law enforcement agencies throughout the country. Individuals using these tabulations are cautioned against drawing conclusions by making direct comparisons between cities. Comparisons lead to simplistic and/or incomplete analyses that often create misleading perceptions adversely affecting communities and their residents. Valid assessments are possible only with careful study and analysis of the range of unique conditions affecting each local law enforcement jurisdiction. It is important to remember that crime is a social problem and, therefore, a concern of the entire community. In addition, the efforts of law enforcement are limited to factors within its control. The data user is, therefore, cautioned against comparing statistical data of individual agencies. Further information on this topic can be obtained in Uniform Crime Reporting Statistics: Their Proper Use.
Haven't been keeping up with the Slack, but is there a plan for this issue? This is definitely a difficult problem to solve. I am wondering if a reasonable approach would be:
Would this be a reasonable game plan, or do you guys think something else would be needed? If this is reasonable, I'm happy to work on it.
Hey guys, just wanted to stop by and see what work is already being done on this task and see how I could help. I'm adantonison on slack.
Update from @mkedataguy: