worldbank / ESG_gaps_research

See draft publication here: https://worldbank.github.io/ESG_gaps_research/
2 stars 5 forks source link

Changes to explanation framework #25

Open tgherzog opened 4 years ago

tgherzog commented 4 years ago
Explanation Indicator count
no_gap 12
expl_a 15
expl_b 15
expl_c 66
expl_d 2
expl_e 8
expl_f 32
expl_g 12
tonyfujs commented 4 years ago

@tgherzog Thanks I'll take a look at this. It's been a while I haven't looked at this repo. Can you refresh my memory and tell which file you are using to produce this table? Thanks!

tgherzog commented 4 years ago

Can you refresh my memory and tell which file you are using to produce this table? Thanks!

@Tony - the table is just a simple sum of the data frame returned by load_metadata, which I call like this (same as in load_data.R):

meta = load_metadata('data/esg_metadata.csv', 'data/ESG_wdi.feather')
meta.sum()
tonyfujs commented 4 years ago

@tgherzog This has been fixed locally. The only pending issue is that indicator "IC.BUS.EASE.XQ" is still missing from Explanation F chart. This because is only has data for 2019, and the availability charts only cover years up to 2018 (see esg_availability.R script).

Is there a specific reason why 2019 should not be included in the analysis? cc @randrescastaneda

tgherzog commented 4 years ago

This is a good point. I think the answer is that we started this way back in mid-2019 and back then there was very little 2019 data and 2018 was considered the "latest available year." We haven't updated the database since except for technical fixes. The paper makes this point in the background section as well as the "no gaps" discussion.

@randrescastaneda had a related issue for the heatmap discussion. I had suggested we expand it to 2019 but now I'm rethinking it.

The easiest solution to me is to consistently limit the analysis to 2000-2018. I can fix the metadata loader code accordingly. This would impact the following indicators:

The other option is to include 2019 consistently throughout but I don't think that's really fair considering when the database was built.