Closed tm-jc-nacpil closed 1 year ago
Sounds good! Just one thing, there seems to be a discrepancy with the index re-calculation.
def recalculate_index(country_list):
"Recalculates wealth index based on specified country"
return df
vs
# Run PCA and recalculate index accordingly
index = PCA(country_household_data)
I suggest the former (creating a recalculate index function with a list of countries as param) though for convenience, as I think it's likely to be re-used.
Hi @alronlam ! I'd like to sanity check an approach for returning the cross-country datasets. :D
generate_cluster/household_level_data
, it stores the output in a class, index by country namedhs_data = DHSDataManager.generate_data()
pattern, while also keeping track of all the countries we've performed it onPseudocode
Ideal usage