intel / mpi-benchmarks

145 stars 63 forks source link

Livelock in case of memory allocation failure #47

Closed EmmanuelBRELLE closed 1 month ago

EmmanuelBRELLE commented 1 year ago

In some cases (for example "IMB-MPI1 Pingpong" running with 3 or more processes) some processes are put aside, waiting for others in an MPI_Barrier. If meanwhile other processes fail to allocate memory (message lengths that are too large in comparison to "-mem" arguement), those processes won't reach the MPI_Barrier call. "Inactive" processes will then hang forever in MPI_Barrier on MPI_COM_WORLD.

JuliaRS commented 5 months ago

@EmmanuelBRELLE, please run with -mem 999 argument of IMB.

JuliaRS commented 1 month ago

@EmmanuelBRELLE is it still actual for you?

EmmanuelBRELLE commented 1 month ago

No that's fine for me. Sorry for late feedback.