SystemsGenetics / KINC

Knowledge Independent Network Construction
MIT License
11 stars 4 forks source link

MPI problems for corrpower analytic #134

Closed spficklin closed 4 years ago

spficklin commented 4 years ago

When I run the corrpower analytic using mpiexec I get the following error message:

../../src/core/pairwise_matrix.cpp:175
void Pairwise::Matrix::write(const Pairwise::Index&, qint8)
PAIRWISE MATRIX LOGICAL ERROR
Attempting to write indent 192 when last written is 18285760.
-------------------------------------------------------
Primary job  terminated normally, but 1 process returned
a non-zero exit code.. Per user-direction, the job has been aborted.
-------------------------------------------------------
--------------------------------------------------------------------------
mpiexec detected that one or more processes exited with non-zero status, thus causing
the job to be terminated. The first process to do so was:

  Process name: [[39414,1],0]
  Exit code:    255
--------------------------------------------------------------------------

I have double checked that I'm giving the proper workblock index so that the workblocks come back in order. But I can't seem to find the problem. Help would be greatly appreciated. This problem is reproducible on my machine using the code in PR #132 .

4ctrl-alt-del commented 4 years ago

Fixed in commit 15237e37d335a1175e297e70803d231b1c188442. Let me know if this fix worked for you @spficklin and I will let you close it.