cea-hpc / wi4mpi

Wrapper interface for MPI
BSD 3-Clause "New" or "Revised" License
80 stars 15 forks source link

Sw4lite: PMPI_Sendrecv(119): Invalid rank has value -2 but must be nonnegative and less than 12 #63

Closed GuillaumeLscrt closed 8 months ago

GuillaumeLscrt commented 9 months ago

The "PMPI_Sendrecv(119): Invalid rank has value -2 but must be nonnegative and less than 12" error has been produced with openmpi-4.1.5 and intelmpi-21.4.0, using the preload mode of wi4mpi.

#SBATCH --ntasks=12
[...]

export OMP_NUM_TRHEADS=4
srun sw4lite Sw4liteBuild/sw4lite/tests/pointsource/ps2.in
Abort(739891974) on node 8 (rank 8 in comm 0): Fatal error in PMPI_Sendrecv: Invalid rank, error stack:
PMPI_Sendrecv(260): MPI_Sendrecv(sbuf=0x14ec09f51970, scount=1, dtype=USER<vector>, dest=MPI_ANY_SOURCE, stag=345, rbuf=0x14ec09f51010, rcount
=1, dtype=USER<vector>, src=4, rtag=345, comm=0x84000002, status=0x7ffc6cf58160) failed
PMPI_Sendrecv(119): Invalid rank has value -2 but must be nonnegative and less than 12

There is also this warning:

sw4lite: Symbol `ompi_mpi_comm_null' has different size in shared object, consider re-linking
kevin-juilly commented 9 months ago

I provided a pull request for the main issue. I'm opening a new issue about the warning and my observation.