Closed trevorcampbell closed 3 years ago
This script needs to be cleaned up a bit, but it is the one which generates (or is supposed to) roster.csv
from profiles.csv
by selecting a single value for each attributes to come up with the "ultimate" profile for each officer (this is explained in the documentation). So it should be ran last, once we have collected profiles from all the datasets.
The key function basically defines the precedence order between datasets in case of a conflict, so yes, in principle you should decide where salary and awards stand of the precedence order.
Thanks for the info! Added precedence order for salary and awards now 780f2f0
@Thibauth I'm not sure what
clean_profiles.py
does -- should that be run after both merging salary and awards? And should there be entries in thekey
function for salary+awards inclean_profiles.py
?