Closed lee212 closed 3 years ago
notes on the recent changes:
ensemble_size
represents a number of replicas: https://github.com/radical-collaboration/MDFF-EnTK/blob/ff1b0d9ef4c5e9cf11cb91f48d56cae0e36e03b5/cfg/workflow_cfg.yml#L2total_iters
represents number of iterations: https://github.com/radical-collaboration/MDFF-EnTK/blob/ff1b0d9ef4c5e9cf11cb91f48d56cae0e36e03b5/cfg/workflow_cfg.yml#L4submt_debug*.sh
wrapper script is not pushed to github remote but stored on $WORLDWORK/bip115/git/MDFF-EnTK
, local repository as it contains auth info. Early result of 8 replicas with 3iterations for 1.8A: https://github.com/radical-collaboration/MDFF-EnTK/tree/master/experiments/summit/simple_mdff_vds_8_replicas_1.8_restart_devel
$WORLDWORK/bip115/conda/rct
simple_mdff_vds.py
, branchingsimple_mdff.py
submit_debug.sh
cfg/workflow_cfg.yml
hasensemble_size
count to represent number of replicascfg/workflow_cfg.yml
hastotal_iter
to represent number of iterations