chicago-police-violence / data

Dataset about the personnel, use of force, and complaints in the Chicago Police Department
MIT License
7 stars 0 forks source link

award+salary in clean_profiles? #29

Closed trevorcampbell closed 3 years ago

trevorcampbell commented 3 years ago

@Thibauth I'm not sure what clean_profiles.py does -- should that be run after both merging salary and awards? And should there be entries in the key function for salary+awards in clean_profiles.py?

Thibauth commented 3 years ago

This script needs to be cleaned up a bit, but it is the one which generates (or is supposed to) roster.csv from profiles.csv by selecting a single value for each attributes to come up with the "ultimate" profile for each officer (this is explained in the documentation). So it should be ran last, once we have collected profiles from all the datasets.

The key function basically defines the precedence order between datasets in case of a conflict, so yes, in principle you should decide where salary and awards stand of the precedence order.

trevorcampbell commented 3 years ago

Thanks for the info! Added precedence order for salary and awards now 780f2f0