Closed yunfangsun closed 1 month ago
It might be a memory issue, can you try to use half of cores/node and redistributes the node memory on less cores? Try a smaller set up and see if it persists.
Hi @aliabdolali ,
My original configuration is using 4500 processors, and I have reduced it to 2500, 1500, and 500 processors. they didn't work out.
And I also increased #SBATCH --mem 9000
, but it didn't help either.
And now, I will try WW3 on Hercules for a small domain case, to see if it could work.
@yunfangsun I was able to run ww3_ufs1.1 for version 8b5e91 without any issue. Please check my set up on Hercules here and let me know if you need help with anything. /work/noaa/marine/sbani/UFS_COASTAL/test_04302024/WW3/regtests/ww3_ufs1.1
@sbanihash Thank you very much for the testing, could you please change permission of the folder of /work/noaa/marine/sbani/UFS_COASTAL/test_04302024/WW3/regtests/ww3_ufs1.1 /work/noaa/marine/sbani/UFS_COASTAL/test_04302024/WW3/regtests/matrix13
I can't get access to it, due to Permission denied
Thank you again!
@yunfangsun can you remind me where we left this issue?
With the updated version of WW3, I can run WW3 without error messages on Hercules.
thx @yunfangsun !
Hi @aliabdolali @Sbanihash,
I met a MPI error when I run the WW3 on Hercules:
Previously, I used WW3 with version 520900e, the model could run smoothly both on Hera and Hercules.
And now I am using exactly the same settings, and updated to WW3 8b5e91f, there is an MPI error occurred on Hercules:
The ww3_shel.out shows:
the error log shows:
And I repeat the same update of WW3 on Hera, there is no such error message, and the model could run. And this error only happens on Hercules.
Could you please let me know if it was caused the configuration or library setting on Hercules?
Thank you very much!
Yunfang