Open zacka-cartercenter opened 1 year ago
I edited the excel file in the teams chat
@esinclairTCC - I harmonized basically all of the adm2_name
s in the pre201905 data set in the main
branch.
FYI - the harmonization is done here:
just 2 small issues that maybe you can provide clarity on:
The only remaining "unharmonized" adm2_names in the pre 201905 dataset are "gambella" and "refugges_gambella"
@sinclairelaina
I've been modifying cleaning and compiling functions to be able to clean the old pre may 2019 data format. While working on these generic functions (in the
R
directory) I've continued testing the in thedata_cleaning.rmd
In the section below the old format data goes through a preliminary cleaning where the names are cleaned based on previous framework (i.e
case_when
s) , but obviously there are some new issues (especially since we don't have adm1 in the old format.On line 389 below the "cleaned" data is compared to the admin master list and the non matching values are returned:
https://github.com/Carter-Center-Health-Data-Support-Unit/CC-RB-LF-SCH-DASHBOARD/blob/f3490de35624f2e68f7be7e7446fc886f40db53c/documentation/data_cleaning.Rmd#L346-L399
Therefore the
clean_adm2
functioncase_when
statement below needs to be augmented:https://github.com/Carter-Center-Health-Data-Support-Unit/CC-RB-LF-SCH-DASHBOARD/blob/f3490de35624f2e68f7be7e7446fc886f40db53c/R/reclassify_admins.R#L137-L166
Can you: