Closed pwolfram closed 6 years ago
@mark-petersen, did this get fixed? This may be related to the memory leak I'm observing locally.
I'm getting a similar issue with the Delaware wetting / drying test case:
App launch reported: 1 (out of 1) daemons - 36 (out of 36) procs
Insufficient memory to allocate Fortran RTL message buffer, message #41 = hex 00000029.
-------------------------------------------------------
Primary job terminated normally, but 1 process returned
a non-zero exit code.. Per user-direction, the job has been aborted.
-------------------------------------------------------
--------------------------------------------------------------------------
mpirun detected that one or more processes exited with non-zero status, thus causing
the job to be terminated. The first process to do so was:
Process name: [[61678,1],1]
Exit code: 41
--------------------------------------------------------------------------
We fixed a memory leak just after that. Of course, I don't know if it is related to the problem you are seeing now. The fix PRs are https://github.com/MPAS-Dev/MPAS/pull/1501 https://github.com/MPAS-Dev/MPAS/pull/1502 https://github.com/MPAS-Dev/MPAS/pull/1515 But they are not worth reading in any detail.
Thanks @mark-petersen, this is great. There may be another issue that has cropped up but the key issue highlighted here was resolved so I'm going to close this issue.
The current version of
ocean/develop
fails witherrors when using the default SOMA test cases across resolutions (4, 8, 16, 32) targeting between 200-300 cells/processor.
This is on grizzly using
make ifort CORE=ocean AUTOCLEAN=true DEBUG=false
withmpirun
.The issue appears to be related to the commit
Testing again (in consultation with @mark-petersen) using hash:
has not yet yielded any errors and appears to be working, suggesting a possible bug (as identified by @mark-petersen) in
1cd59184880c14dfc9b80292453d37f66fa792d6
.