equinor / ert

ERT - Ensemble based Reservoir Tool - is designed for running ensembles of dynamical models such as reservoir models, in order to do sensitivity analysis and data assimilation. ERT supports data assimilation using the Ensemble Smoother (ES), Ensemble Smoother with Multiple Data Assimilation (ES-MDA) and Iterative Ensemble Smoother (IES).
https://ert.readthedocs.io/en/latest/
GNU General Public License v3.0
98 stars 104 forks source link

Add Stratified Sampling to ERT #7854

Open sondreso opened 2 months ago

sondreso commented 2 months ago

Feature Request:

Implementing stratified sampling in ERT to improve the sampling process. Stratified sampling could potentially provide a better coverage of the sample space and reduce the risk of clustering. The suggestion includes the possibility of adding a configuration option for the parameter group to define the sampling strategy, such as RANDOM or STRATIFIED, with the potential to add LATIN_HYPERCUBE in the future.

Suggested Feature

  1. Stratified Sampling Implementation: Introduce stratified sampling as an option for parameter updates within ERT. This would involve setting d=1 and converting to normal distribution, following the method described in this Stack Overflow post.

  2. Configuration Option: Add a new configuration option SAMPLING_STRATEGY on the parameter group level in the ERT configuration files. The user could specify SAMPLING_STRATEGY:STRATIFIED to enable stratified sampling for that parameter group.

  3. Naming Consideration: Instead of naming the random sampling strategy as RANDOM, consider using MONTE_CARLO, if this is appropriate.

Benefits

Considerations

Additional Context


This feature request has been compiled from an internal discussion (link).

Blunde1 commented 2 months ago

This should be under Considerations:

In ERT we understand the update through the statistical properties of the ensemble X (independently sampled). We estimate Cov(X), have monte carlo samples of Y=g(X), estimate Cov(Y) and regress Y on X. The properties are understood. Changing the sampling, one risks loosing the understanding. The statistical properties, at least of these specific estimators, as a function of an LHC sample, must be understood before changing ERT.

sondreso commented 2 months ago

If we use the d=1 option, the variables would still be independently sampled, or no? 🤔

Blunde1 commented 2 months ago

If we use the d=1 option, the variables would still be independently sampled, or no? 🤔

I think across dimensions, yes, but not across samples.