ufs-community / ufs-srweather-app

UFS Short-Range Weather Application
Other
56 stars 119 forks source link

[develop] Fix crontab bug for Cheyenne and Derecho, update PR template for new platforms #934

Closed mkavulich closed 1 year ago

mkavulich commented 1 year ago

DESCRIPTION OF CHANGES:

The option to create an experiment with the option USE_CRON_TO_RELAUNCH=True is currently broken on Cheyenne and Derecho due to some bad python logic. This PR fixes that issue.

I also took the opportunity to update the PR template to include the new supported platforms (Derecho, Hercules, and Gaea C5)

Type of change

TESTS CONDUCTED:

Ran WE2E fundamental tests with the option --launch=cron on three platforms. Previously failing on Cheyenne an Derecho, these tasks all succeed except for the grid_RRFS_CONUS_25km_ics_NAM_lbcs_NAM_suite_GFS_v16 test on Cheyenne: this is a pre-existing failure (see Issue #933)

DEPENDENCIES:

None

DOCUMENTATION:

None

ISSUE:

Fixes #932

CHECKLIST

MichaelLueken commented 1 year ago

@mkavulich - I wanted to let you know that the SRW v2.2 release branch was created last Friday - release/public-v2.2.0. This update should be included in the release, so please open another PR and make these changes, at least to ush/get_crontab_contents.py, in the release/public-v2.2.0 release branch. Thanks!

MichaelLueken commented 1 year ago

The Hera GNU tests failed in the Functional Workflow Task Tests. Relaunching the tests on that machine now.

MichaelLueken commented 1 year ago

The WE2E coverage tests were run on Derecho and all successfully passed:

----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used 
----------------------------------------------------------------------------------------------------
custom_ESGgrid_IndianOcean_6km                                     COMPLETE              21.55
grid_RRFS_CONUS_13km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v16_plot     COMPLETE              34.70
grid_RRFS_CONUS_25km_ics_NAM_lbcs_NAM_suite_GFS_v16                COMPLETE              42.16
grid_RRFS_CONUScompact_13km_ics_HRRR_lbcs_RAP_suite_HRRR           COMPLETE              26.85
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_RAP_suite_RRFS_v1beta    COMPLETE              16.51
grid_SUBCONUS_Ind_3km_ics_HRRR_lbcs_HRRR_suite_HRRR                COMPLETE              38.01
nco_grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_timeoffset_suite_  COMPLETE              22.55
pregen_grid_orog_sfc_climo                                         COMPLETE              12.77
specify_template_filenames                                         COMPLETE              13.20
----------------------------------------------------------------------------------------------------
Total                                                              COMPLETE             228.30

Additionally, while the Jenkins pipeline doesn't show the status of the Test stage for Gaea, all tests successfully ran and passed:

----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used 
----------------------------------------------------------------------------------------------------
community                                                          COMPLETE              32.58
custom_ESGgrid_NewZealand_3km                                      COMPLETE              82.04
grid_RRFS_CONUScompact_13km_ics_HRRR_lbcs_RAP_suite_RRFS_v1beta    COMPLETE              40.09
grid_RRFS_CONUS_13km_ics_FV3GFS_lbcs_FV3GFS_suite_RAP              COMPLETE              48.88
grid_RRFS_CONUS_13km_ics_FV3GFS_lbcs_FV3GFS_suite_HRRR             COMPLETE              41.41
grid_RRFS_CONUS_3km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v15_thompson  COMPLETE             432.25
grid_RRFS_CONUScompact_25km_ics_HRRR_lbcs_HRRR_suite_HRRR          COMPLETE              51.55
grid_RRFS_CONUScompact_3km_ics_HRRR_lbcs_RAP_suite_RRFS_v1beta     COMPLETE             391.29
grid_SUBCONUS_Ind_3km_ics_RAP_lbcs_RAP_suite_RRFS_v1beta_plot      COMPLETE              15.88
nco_ensemble                                                       COMPLETE             122.72
nco_grid_RRFS_CONUS_3km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v15_thom  COMPLETE             414.46
----------------------------------------------------------------------------------------------------
Total                                                              COMPLETE            1673.15

Once the Hera GNU tests successfully complete, I will be able to move forward and merge this work.

MichaelLueken commented 1 year ago

The Jenkins tests continued to fail on Hera GNU in the Functional Workflow Task Tests stage (30 minutes isn't enough time to run the community test using GNU compiled executables and fixes are added in PR #935 and #936 for develop and release/public-v2.2.0, respectively). The Hera GNU coverage tests were manually run on Hera GNU and successfully passed:

----------------------------------------------------------------------------------------------------
Experiment name                                                  | Status    | Core hours used 
----------------------------------------------------------------------------------------------------
custom_ESGgrid_Peru_12km                                           COMPLETE              31.48
get_from_HPSS_ics_FV3GFS_lbcs_FV3GFS_fmt_nemsio_2019061200         COMPLETE              18.88
get_from_NOMADS_ics_FV3GFS_lbcs_FV3GFS                             COMPLETE              33.88
grid_RRFS_CONUS_13km_ics_FV3GFS_lbcs_FV3GFS_suite_HRRR             COMPLETE             233.42
grid_RRFS_CONUS_25km_ics_FV3GFS_lbcs_FV3GFS_suite_RRFS_v1beta      COMPLETE              36.70
grid_SUBCONUS_Ind_3km_ics_HRRR_lbcs_RAP_suite_WoFS_v0              COMPLETE              23.34
long_fcst                                                          COMPLETE              81.91
MET_verification_only_vx                                           COMPLETE               0.12
MET_ensemble_verification_only_vx_time_lag                         COMPLETE               7.74
nco_grid_RRFS_CONUS_13km_ics_FV3GFS_lbcs_FV3GFS_suite_GFS_v16      COMPLETE             341.16
----------------------------------------------------------------------------------------------------
Total                                                              COMPLETE             808.63

Merging this PR now.