Closed amckinstry closed 3 years ago
Just to be clear: are you saying the OMPI v4.1.0 passes these tests if configured with PMIx v3.2.2, but fails if configured with PMIx v4.0.0? There as a bug in generalized requests that has been fixed on the branch, but that had nothing to do with PMIx (AFAIK).
A quick test shows trying 4.1.0 with pmix 3.2.2 doesn't fix it
So something has changed in 4.0.5 -> 4.1.0 (or our related config)
This was fixed in #8340 (nothing to do with PMIx).
To clarify: this was reported in #8340 and fixed in the v4.1.x branch in #8348.
This solves the issue for us.
@amckinstry Excellent. FYI: we only have one or two more things before we plan to roll a v4.1.1rc. Should be "Real Soon Now".
This is on Debian unstable, with OpenMPI 4.1.0 in MPI4PY (3.0.3) test suite.
The errors we're seeing are here: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=979480
The tests work with openmpi 4.0.5.
In both cases the /etc/openmpi/openmpi-mca-params.conf has:
to allow testing on single node systems (This is with oversubscription enabled).
Failed case uses pmix 4.0.0, working case uses 3.2.2