Open maxlogutility opened 3 years ago
Implemented the above procedure by this commit. It's found that broad prior of random_infection_rate
, exposure_exponent
, and exposure_probability4unit_contact
results in very high variance of total_death
even if the policy parameters are fixed. This high variance leads the situation that policy parameters that lead empirically-minimal total_death
could be merely random outcomes by over-fitting. The problem is expected to vanish once we have reasonable datasets to estimate the three disease parameters above.
Number of deaths and person * n_days in isolation are in a trade-off, where longer isolation reduces total number of death but increase the person days that result in stresses for citizens. So we are interested in the efficient frontier between these two metrics, and what are the optimal policy parameters corresponding to each point on that frontier line.
If we have good estimates of disease parameters, then what we need to do is to search for the best policy parameters under the fixed disease parameters. However, because lack of real data results in high uncertainty (e.g., prior variance as opposed to posterior variance when data are available), we are interested in reasonably good policy parameters that can work under many different possibilities of disease parameters.
For a simple but effective start point, we estimate average-case best policies and worst-case best policies under prior uncertainty, and draw the efficient frontier for each of the policies. To this end, after drawing the samples of both disease and policy parameters by
draw_parameters.py
and fit the meta models oftotal_death
andperson_days_in_isolation
, we should do the following:total_death
be f(Xd, Xp) and that ofperson_days_in_isolation
be g(Xd, Xp), where Xd is the vector of disease parameters and Xp is the vector of policy parameters.mean_f(Xp_i) = (1/n) sum_{t=1}^n f(Xd_t, Xp_i)
andmean_g(Xp_i) = (1/n) sum_{t=1}^n g(Xd_t, Xp_i)
y1_i = mean_f(Xp_i)
vsy2_i = mean_g(Xp_i)
across i=1,2, ...,n
(x, y)
such thaty = min_Y E[Y | X<=x]
(y1_i, y2_i)
that are on that efficient frontier and correspondingXp_i
that are the best policy parametersFor the worst-case evaluation, the above
mean_f
andmean_g
should be replaced by the empirical maximum or 99%-tile across then
samples.This method is brute-force and multi-modality of functions
f
andg
will lead discontinuity of chosen policy parameters between the neighboring points on the frontier. The parameters found, however, are guaranteed to have some robustness across many different environments. If we draw from the posterior distribution of disease parameters instead of the prior, then we can be further confident on the set of workable policies.If you have comments on this approach comments are welcome. @sibylleM @LouiseMatthews