Leeds-MRG / Minos

SIPHER Microsimulation for estimating the effect on Income policy on mental health.
MIT License
4 stars 3 forks source link

Validation #35

Open ld-archer opened 2 years ago

ld-archer commented 2 years ago

Starting this early so we can keep adding to it when we think of things. E.g. add data sources for validation if/when we find them.

ld-archer commented 2 years ago

Household Income

We want to validate household income directly using some external data source, and it might be good to validate the council tax values we generated to use in the calculations. Northern Ireland will be a problem if we validate the council tax values as they calculate their tax differently to the rest of the UK.

The hsctax variable seems to be council tax information including the range for each band. If we see a problem with hh_income then we can look into using this variable and the ranges that go with it.

RobertClay commented 2 years ago
RobertClay commented 2 years ago

plan longitudunal validation.

ld-archer commented 1 year ago

Internal Validation

Cross-validation - split dataset in half, use one side to estimate transition models and the other side as a start point for the simulation. See how well our simulated data tracks with real life.

ROC Curve - The only binary absorbing variable we have is death, but this is a very important component to validate. We can use the pROC package in R.

ld-archer commented 1 year ago

External Validation

Comparison With Actual Data

This would be comparing our nowcasting (simulate from 2009-2020ish) to actual data from any source, but best from official outputs and key statistics. Here would be a good place to list any data we find:

Corroboration

Corroborate against other microsimulations - Scottish government microsims have done some similar interventions using UKMOD that we could compare our own outputs to.

ld-archer commented 1 year ago

PRORITY::

Big facet grid of cross-validation plots. Continuous line plots for hh_income and SF12, and stacked bar plots for ordinal pathway variables.