Closed boegel closed 8 months ago
LGTM, very minor remarks.
Don't merge this yet please, we should actually test first that the workaround implemented here fixes the issues...
Don't merge this yet please, we should actually test first that the workaround implemented here fixes the issues...
ok
@hajgato is testing this currently, seems to work as designed for me with a quick test (MPI hello on top of OpenMPI)
@hajgato is testing this currently, seems to work as designed for me with a quick test (MPI hello on top of OpenMPI)
so this can be merged?
@wdpypere Yes, tests were passed on Tier1
Motivation for this are the inconsistent errors "
Failed to modify UD QP to INIT on mlx5_0: Operation not permitted
" that we have been seeing after updating to OFED 23.10.Worth noting, same can be achieved in contexts where
mympirun
is not used via: