ONEcampaign / toughest_places_index

New analysis from the ONE Campaign presents the toughest places to feed a family
MIT License
1 stars 0 forks source link

Discussion: weighting options #43

Closed jm-rivera closed 2 years ago

jm-rivera commented 2 years ago

Starting a discussion on this @amy-dodd @lpicci96 @Mattie-P

For simplicity, equal weights can be favoured.

Equal weights (or similarly ‘no’ weights) All indicators get assigned the same weight. In other words, the final index could simply be the non-weighted arithmetic average of the normalised indicators.

Pros:

Cons:

Some alternatives.

All come with complexity cost (not necessarily for building but for explaining)

Unequal weights Indicators are assigned different values for different reasons. These could include coverage (so indicators with full data are assigned a heavier weight than indicators with lots of imputed values), trustworthiness of the source, or expert judgement.

Pros:

Cons:

Budget Allocation Process Like the above, but we go to a panel of experts to get their take regarding appropriate weights. The basic idea is that each expert gets a series of points to distribute to the indicators and then an average of the experts’ choices is used to create the weights. Another similar (but slightly more complex) is the Analytic Hierarchy Process.

Pros:

Cons

Data-driven options

Correlation Analysis Correlation analysis can be used to obtain weights in two ways: 1) simple correlation matrix, with the indicator weights being proportional to the sum of the absolute values of that row or column respectively. 2)identifying an indicator to serve as the endogenous criterion to which we compare all other indicators in order to derive their weights.

Pros

Cons

Multiple Linear Regression Analysis could be consider to take this logic one step further and explore causal links, but the assumption that there is strict linearity on these types of issues is very debatable.

Principal Component Analysis (PCA) The idea here is basically a reduction of variables (/indicators). You try to capture the highest possible variance with as few indicators as possible. This is done through a series of linear transformations of the standardised indicators. It can be used in two ways: 1) to set weights or 2) to create new indicators (principal components) that describe your data. Number 1 is easier to communicate and more useful for our purpose so I’ll focus on that one here.

Pros

Cons

Both correlation Analysis and PCA produce weights that are derived from the data. So if the data changes (like if we keep updating this regularly/as new data comes) it can make direct comparisons difficult.

Benefit of the Doubt (a version of data envelopment analysis) The idea here is that most weighting systems will favour certain countries and penalise others. Here, the weights are constructed for each country in such a way as to maximise its performance.

Pros

Cons

amy-dodd commented 2 years ago

Thanks for this Jorge - be good to discuss but I'm inclined to keep it as simple as possible for all the reasons we've discussed. At first glance my preferences are probably:

Budget allocation process (needs way too much time), correlation and the benefit of the doubt approach don't seem right for our purposes or timeline here.