NVIDIA / modulus-sym

Framework providing pythonic APIs, algorithms and utilities to be used with Modulus core to physics inform model training as well as higher level abstraction for domain experts
https://developer.nvidia.com/modulus
Apache License 2.0
147 stars 60 forks source link

🐛[BUG]: Experiment manager currently stops at the first failure interrupting QA tests #55

Closed ktangsali closed 11 months ago

ktangsali commented 11 months ago

Version

1.1.0

On which installation method(s) does this occur?

Docker, Pip, Source

Describe the issue

The run_experiments.py exits when there is a failure training the sym examples. We want to avoid this and carry on with training of all available examples in the experiments.json file. adding add exit_on_fail=False to https://github.com/NVIDIA/modulus-sym/blob/main/examples/test/run_experiments.py#L87 helps

Minimum reproducible example

No response

Relevant log output

No response

Environment details

No response

Other/Misc.

No response