Open aerorahul opened 3 days ago
The walltimes for the *anal and eupd jobs likely need to be increased, at least temporarily, for Orion given that the GSI is running far less than optimally on the system. Based on @RussTreadon-NOAA's findings, I'd suggest doubling the requested walltimes for gdas/gfsanal to 2:40 and 2:00, respectively. The eupd job typically runs in much less than 30 minutes on most systems, so it may be fine and could be revisited later if need be.
Thanks for reminding me of this issue.
I have added a config.resources.ORION
increasing the walltime for gfs|gdas
analysis task, doubling the walltime with a block comment on the need.
CI Update on Wcoss2 at 07/01/24 05:12:08 PM
============================================
Cloning and Building global-workflow PR: 2741
with PID: 37144 on host: dlogin08
Ah, I see now you will want to update module_gwsetup.orion.ua and module_gwci.orion.lua to point to the new spack-stack.
Thanks @DavidHuber-NOAA updates to these modulefiles have been applied.
Automated global-workflow Testing Results:
Machine: Wcoss2
Start: Mon Jul 1 17:14:58 UTC 2024 on dlogin08
---------------------------------------------------
Build: Completed at 07/01/24 05:51:20 PM
Case setup: Completed for experiment C48_ATM_e87db13e
Case setup: Skipped for experiment C48mx500_3DVarAOWCDA_e87db13e
Case setup: Skipped for experiment C48_S2SWA_gefs_e87db13e
Case setup: Completed for experiment C48_S2SW_e87db13e
Case setup: Completed for experiment C96_atm3DVar_extended_e87db13e
Case setup: Skipped for experiment C96_atm3DVar_e87db13e
Case setup: Completed for experiment C96_atmaerosnowDA_e87db13e
Case setup: Completed for experiment C96C48_hybatmDA_e87db13e
Case setup: Completed for experiment C96C48_ufs_hybatmDA_e87db13e
Looks good (again).
One additional note: since the UPP cannot be built as is, users will get an error from
build_all.sh
that they should be instructed to ignore. Alternatively, we could temporarily disable that build on Orion within build_upp.sh (with a printed note). In any case, the*anlupp
jobs will be unable to run, preventing cycling.
Thanks @DavidHuber-NOAA This PR is not sufficient for Orion to be up and usable again. @WenMeng-NOAA is aware.
All CI Test Cases Passed on Wcoss2:
Experiment C48_ATM_e87db13e *** SUCCESS *** at 07/01/24 07:08:12 PM
Experiment C48_S2SW_e87db13e *** SUCCESS *** at 07/01/24 07:16:11 PM
Experiment C96C48_hybatmDA_e87db13e *** SUCCESS *** at 07/01/24 08:32:37 PM
Experiment C96_atmaerosnowDA_e87db13e *** SUCCESS *** at 07/01/24 09:08:29 PM
Experiment C96C48_ufs_hybatmDA_e87db13e *** SUCCESS *** at 07/01/24 09:56:25 PM
Experiment C96_atm3DVar_extended_e87db13e *** SUCCESS *** at 07/02/24 04:36:35 AM
Experiment C48mx500_3DVarAOWCDA FAILED on Hera with error logs:
/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/COMROOT/C48mx500_3DVarAOWCDA_c6962496/logs/2021032412/gdasfcst.log
Follow link here to view the contents of the above file(s): (link)
Experiment C96C48_hybatmDA FAILED on Hera in
/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C96C48_hybatmDA_c6962496
Experiment C48mx500_3DVarAOWCDA FAILED on Hera in
/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C48mx500_3DVarAOWCDA_c6962496
Experiment C96_atm3DVar FAILED on Hera with error logs:
/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/COMROOT/C96_atm3DVar_c6962496/logs/2021122018/gdasfcst.log
Follow link here to view the contents of the above file(s): (link)
Experiment C48_ATM FAILED on Hera in
/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C48_ATM_c6962496
Experiment C48_S2SW FAILED on Hera in
/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C48_S2SW_c6962496
Experiment C96_atmaerosnowDA FAILED on Hera in
/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C96_atmaerosnowDA_c6962496
Experiment C96_atm3DVar FAILED on Hera in
/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C96_atm3DVar_c6962496
Experiment C48_S2SWA_gefs FAILED on Hera in
/scratch1/NCEPDEV/global/CI/2741/RUNTESTS/C48_S2SWA_gefs_c6962496
@KateFriedman-NOAA Is prepobs 1.0.2 installed on Hera?
Description
This PR:
gwsetup
andgwci
as well. (We need to find a better way that spreading this information in 3 different modulefiles)parm/config.gfs/config.resources.ORION
to address GSI performance degradation after Rocky 9 upgrade.This PR:
prepobs
andfit2obs
updated locations when they are installed inglopara
space on Orion.Type of change
Change characteristics
How has this been tested?
This PR is not sufficient for Orion. This PR must be tested on other platforms (Hera, WCOSS2) as this PR updates submodules.
Checklist