======================== JOB MAP ========================
Data for JOB prterun-farm22-gpu0103-226592@1 offset 0 Total slots allocated 128
Mapping policy: BYSLOT:NOOVERSUBSCRIBE Ranking policy: SLOT Binding policy: PACKAGE
Cpu set: N/A PPR: N/A Cpus-per-rank: N/A Cpu Type: CORE
Data for node: farm22-gpu0103 Num slots: 2 Max slots: 0 Num procs: 2
Process jobid: prterun-farm22-gpu0103-226592@1 App: 0 Process rank: 0 Bound: package[0][core:0-15]
Process jobid: prterun-farm22-gpu0103-226592@1 App: 0 Process rank: 1 Bound: package[0][core:0-15]
Data for node: farm22-gpu0104 Num slots: 2 Max slots: 0 Num procs: 2
Process jobid: prterun-farm22-gpu0103-226592@1 App: 0 Process rank: 2 Bound: package[0][core:0-15]
Process jobid: prterun-farm22-gpu0103-226592@1 App: 0 Process rank: 3 Bound: package[0][core:0-15]
=============================================================
but the process is stopped with the following error
The rankfile that was used claimed that a host was either not
allocated or oversubscribed its slots. Please review your rank-slot
assignments and your host allocation to ensure a proper match. Also,
some systems may require using full hostnames, such as
"host1.example.com" (instead of just plain "host1").
I am using the following mpi:
MPI_HOST_STRING and hostfile
the mapping looks correct: