The following program appears to OOM (killed by the job scheduler) when with a process per core on a 40 node machine, and errors out with a memory corruption bug with 2 processes (running with only 1 process raises a warning that the local size of a tensor is larger than INT_MAX) when run on the arabic-2005 matrix.
The following program appears to OOM (killed by the job scheduler) when with a process per core on a 40 node machine, and errors out with a memory corruption bug with 2 processes (running with only 1 process raises a warning that the local size of a tensor is larger than INT_MAX) when run on the arabic-2005 matrix.