ESCOMP / CMEPS

NUOPC Community Mediator for Earth Prediction Systems
https://escomp.github.io/CMEPS/
22 stars 78 forks source link

Runtime hang when ESMF_AWARE_THREADING is on #483

Open nmizukami opened 2 months ago

nmizukami commented 2 months ago

Related to #482. Once two compiling error described in #482 locally, the model is successfully compiled. when try running, however, it hangs at very beginning of the initialization. I got PET error like this.

20240708 185822.851 ERROR            PET1290 esm.F90:1157 Not valid  -  Component ROF has mpi task mismatch, do threads align with nodes?
20240708 185822.855 ERROR            PET1290 esm.F90:203 Not valid  - Passing error in return code
20240708 185822.855 ERROR            PET1290 ESM0001:src/addon/NUOPC/src/NUOPC_Driver.F90:797 Not valid  - Passing error in return code
20240708 185822.855 ERROR            PET1290 ensemble:src/addon/NUOPC/src/NUOPC_Driver.F90:2901 Not valid  - Phase 'IPDv02p1' Initialize for modelComp 1: ESM0001 did not r
eturn ESMF_SUCCESS
20240708 185822.855 ERROR            PET1290 ensemble:src/addon/NUOPC/src/NUOPC_Driver.F90:1329 Not valid  - Passing error in return code
20240708 185822.855 ERROR            PET1290 ensemble:src/addon/NUOPC/src/NUOPC_Driver.F90:486 Not valid  - Passing error in return code
20240708 185822.855 ERROR            PET1290 esmApp.F90:134 Not valid  - Passing error in return code
20240708 185822.855 INFO             PET1290 Finalizing ESMF

This needs some esmf expert knowledge to look deeper into the problem.