Open A-CGray opened 6 months ago
When trying to run the adflow tests on the public:u22-intel-impi-latest-amd64
image from https://github.com/mdolab/docker/pull/266 on my machine I get the following errors on many of the tests, any idea what's going on here @eirikurj ?
(mpi) ./tests/reg_tests/test_functionals.py:TestFunctionals_2_euler_matrix_jst_tut_wing.test_forces_and_tractions ... FAIL (00:00:0.00, 0 MB)
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 0 PID 3826 RUNNING AT 9dee14de4df5
= KILLED BY SIGNAL: 7 (Bus error)
===================================================================================
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 1 PID 3827 RUNNING AT 9dee14de4df5
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================
(mpi) ./tests/reg_tests/test_functionals.py:TestFunctionals_2_euler_matrix_jst_tut_wing.test_functions ... FAIL (00:00:0.00, 0 MB)
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 0 PID 3837 RUNNING AT 9dee14de4df5
= KILLED BY SIGNAL: 7 (Bus error)
===================================================================================
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 1 PID 3838 RUNNING AT 9dee14de4df5
= KILLED BY SIGNAL: 9 (Killed)
===================================================================================
You probably need to increase the shared memory size. You can add a flag when starting the container, docker run --shm-size=XX
. The default is 65MB, but you can increase it significantly, e.g., for 2GB add --shm-size=2G
. This is probably too big in general (something like O(100) MB, e.g., 256MB is probably sufficient), but should be fine since we should have plenty of RAM and not too many containers running, but you can experiment. If you dont want to bother with per-container settings, then you can add the following to /etc/docker/daemon.json
{
"default-shm-size": "2G"
}
but you might want to keep this smaller then. See if this resolves your immediate problem.
Description
A handful of the complex ADflow regression tests are failing on the latest docker PR that uses the new intel
ifx
andmpiifx
compilers. Most likely we need to re-train the tests.Current behavior
Expected behavior
Tests should pass
Code versions