ucgmsim / slurm_gm_workflow

Porting the GM workflow to run on new NeSI HPC (Maintainer: Jonney)
MIT License
0 stars 2 forks source link

Skip e3d.par when moving tmp files #526

Closed sungeunbae closed 2 months ago

sungeunbae commented 2 months ago

When EMOD3D fails, the slurm/PBS script relocate incomplete output files to a temporary directory for later debug. However, it also moves e3d.par, which will make the restart attempt futile as it will fail instantly if e3d.par is missing.

This change hopefully fixes this issue

sungeunbae commented 2 months ago

Is this able to be tested to "fix" this issue? Just to be sure there wouldn't need to be a 2nd PR for a fix

Sure - I will force it to fail and test this fix.

sungeunbae commented 2 months ago

Is this able to be tested to "fix" this issue? Just to be sure there wouldn't need to be a 2nd PR for a fix

Sure - I will force it to fail and test this fix.

@joelridden On NeSI, /nesi/nobackup/nesi00213/RunFolder/Cybershake/v24p9/Runs/HikWgtnmax/HikWgtnmax, the EMOD3D job failed due to out of memory, but e3d.par file stayed - the fix works as intended.