NOAA-EMC / global-workflow

Global Superstructure/Workflow supporting the Global Forecast System (GFS)
https://global-workflow.readthedocs.io/en/latest
GNU Lesser General Public License v3.0
70 stars 162 forks source link

Update (partially) global-workflow for orion+rocky9 #2741

Open aerorahul opened 3 days ago

aerorahul commented 3 days ago

Description

This PR:

This PR:

Type of change

Change characteristics

How has this been tested?

This PR is not sufficient for Orion. This PR must be tested on other platforms (Hera, WCOSS2) as this PR updates submodules.

Checklist

aerorahul commented 3 days ago

The walltimes for the *anal and eupd jobs likely need to be increased, at least temporarily, for Orion given that the GSI is running far less than optimally on the system. Based on @RussTreadon-NOAA's findings, I'd suggest doubling the requested walltimes for gdas/gfsanal to 2:40 and 2:00, respectively. The eupd job typically runs in much less than 30 minutes on most systems, so it may be fine and could be revisited later if need be.

Thanks for reminding me of this issue. I have added a config.resources.ORION increasing the walltime for gfs|gdas analysis task, doubling the walltime with a block comment on the need.

emcbot commented 3 days ago

CI Update on Wcoss2 at 07/01/24 05:12:08 PM
============================================
Cloning and Building global-workflow PR: 2741
with PID: 37144 on host: dlogin08
aerorahul commented 3 days ago

Ah, I see now you will want to update module_gwsetup.orion.ua and module_gwci.orion.lua to point to the new spack-stack.

Thanks @DavidHuber-NOAA updates to these modulefiles have been applied.

emcbot commented 3 days ago

Automated global-workflow Testing Results:


Machine: Wcoss2
Start: Mon Jul  1 17:14:58 UTC 2024 on dlogin08
---------------------------------------------------
Build: Completed at 07/01/24 05:51:20 PM
Case setup: Completed for experiment C48_ATM_e87db13e
Case setup: Skipped for experiment C48mx500_3DVarAOWCDA_e87db13e
Case setup: Skipped for experiment C48_S2SWA_gefs_e87db13e
Case setup: Completed for experiment C48_S2SW_e87db13e
Case setup: Completed for experiment C96_atm3DVar_extended_e87db13e
Case setup: Skipped for experiment C96_atm3DVar_e87db13e
Case setup: Completed for experiment C96_atmaerosnowDA_e87db13e
Case setup: Completed for experiment C96C48_hybatmDA_e87db13e
Case setup: Completed for experiment C96C48_ufs_hybatmDA_e87db13e
aerorahul commented 3 days ago

Looks good (again).

One additional note: since the UPP cannot be built as is, users will get an error from build_all.sh that they should be instructed to ignore. Alternatively, we could temporarily disable that build on Orion within build_upp.sh (with a printed note). In any case, the *anlupp jobs will be unable to run, preventing cycling.

Thanks @DavidHuber-NOAA This PR is not sufficient for Orion to be up and usable again. @WenMeng-NOAA is aware.

emcbot commented 3 days ago

All CI Test Cases Passed on Wcoss2:


Experiment C48_ATM_e87db13e *** SUCCESS *** at 07/01/24 07:08:12 PM
Experiment C48_S2SW_e87db13e *** SUCCESS *** at 07/01/24 07:16:11 PM
Experiment C96C48_hybatmDA_e87db13e *** SUCCESS *** at 07/01/24 08:32:37 PM
Experiment C96_atmaerosnowDA_e87db13e *** SUCCESS *** at 07/01/24 09:08:29 PM
Experiment C96C48_ufs_hybatmDA_e87db13e *** SUCCESS *** at 07/01/24 09:56:25 PM
Experiment C96_atm3DVar_extended_e87db13e *** SUCCESS *** at 07/02/24 04:36:35 AM
emcbot commented 1 day ago

Experiment C48mx500_3DVarAOWCDA FAILED on Hera with error logs:

/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/COMROOT/C48mx500_3DVarAOWCDA_c6962496/logs/2021032412/gdasfcst.log

Follow link here to view the contents of the above file(s): (link)

emcbot commented 1 day ago

Experiment C96C48_hybatmDA FAILED on Hera in /scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C96C48_hybatmDA_c6962496

emcbot commented 1 day ago

Experiment C48mx500_3DVarAOWCDA FAILED on Hera in /scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C48mx500_3DVarAOWCDA_c6962496

emcbot commented 1 day ago

Experiment C96_atm3DVar FAILED on Hera with error logs:

/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/COMROOT/C96_atm3DVar_c6962496/logs/2021122018/gdasfcst.log

Follow link here to view the contents of the above file(s): (link)

emcbot commented 1 day ago

Experiment C48_ATM FAILED on Hera in /scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C48_ATM_c6962496

emcbot commented 1 day ago

Experiment C48_S2SW FAILED on Hera in /scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C48_S2SW_c6962496

emcbot commented 1 day ago

Experiment C96_atmaerosnowDA FAILED on Hera in /scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C96_atmaerosnowDA_c6962496

emcbot commented 1 day ago

Experiment C96_atm3DVar FAILED on Hera in /scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C96_atm3DVar_c6962496

emcbot commented 1 day ago

Experiment C48_S2SWA_gefs FAILED on Hera in /scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C48_S2SWA_gefs_c6962496

aerorahul commented 1 day ago

@KateFriedman-NOAA Is prepobs 1.0.2 installed on Hera?