PecanProject / pecan

The Predictive Ecosystem Analyzer (PEcAn) is an integrated ecological bioinformatics toolbox.
www.pecanproject.org
Other
203 stars 235 forks source link

Clarify definition / purpose of ensemble IDs #891

Open rykelly opened 8 years ago

rykelly commented 8 years ago

Ensemble IDs are used to distinguish a group of related model runs. To date they are used to keep track of runs associated with ensemble analysis, sensitivity analysis, and parameter data assimilation (EA, SA, PDA).

There's generally been at most one of each of these analyses per workflow, and so the ens.id has been kept with the EA, SA, and PDA blocks of pecan.xml. But this isn't ideal, because those blocks really keep track of the settings for the analysis, which might be applied to multiple ensembles. In particular, as we implement multi-site workflows, you might want to define the SA (for example) settings once, but apply it to each site separately. After a lot of back and forth, here's some mods to the existing settings structure that Mike and I propose:

<sensitivity.analysis analysis=1>

and a block might have something like

<ensemble.ids>
  <ensemble.id runtype="sensitivity" analysis=1>1000000034</ensemble.id>
  ...

* As you may have noticed, we could use a new name for the <run> block. "run" already refers to a single model run, whereas <run> in pecan.xml stores information that generally applies to many model runs (multiple ensembles, in fact, as we've just discussed). Currently that's site and input settings and start/end dates, but it might include <model> in the future for the purpose of multi-model comparison. Candidates welcome.

github-actions[bot] commented 4 years ago

This issue is stale because it has been open 365 days with no activity.