E3SM-Project / zppy

E3SM post-processing toolchain
BSD 3-Clause "New" or "Revised" License
6 stars 15 forks source link

Revise ncclimo default invocation #9

Open chengzhuzhang opened 3 years ago

chengzhuzhang commented 3 years ago

zppy chooses to invoke ncclimo in MPI mode with requesting 4 nodes for climo jobs. Based on email exchanges with @czender: For all low resolution simulations, use the default background mode instead, will potentially reduce queue time and save resources. MPI mode can be useful for HR simulations, and those can either be assigned multiple nodes in MPI mode or accomplished in background mode by ratcheting down the number of concurrent processes (to relieve RAM pressure) with the --job_nbr option.

tangq commented 3 years ago

Do you encounter problems with the current zppy climo default?

chengzhuzhang commented 3 years ago

I experience long time for the queued jobs to run on Cori. At today's LR+RRM finalization meeting, @xuezhengllnl also mentioned long queue time for post processing jobs. I thought revising the ncclimo default invocation for climo will alleviate this problem as well.

tangq commented 3 years ago

Did you use the debug queue? climos normally don't take long to compute and can fit in the cori debug queue.

You can try, but I don't see an easy, optimal way as it depends on the machine queue configs and more importantly the realtime machine usage by others.

golaz commented 3 years ago

@chengzhuzhang nd @czender : I'll gladly defer to you two regarding the best default settings for post-processing with ncclimo.

Here is an example of a configuration file for some v1-HR output on chrysalis. Note that MPAS-Analysis will fail because some mpas-o output files are corrupted, so you may want to turn the task off.

[default]
input = /lcrc/group/e3sm/ac.ndkeen/scratch/chrys/E3SM_simulations/production-unc06
input_subdir = run
output = /lcrc/group/e3sm/ac.golaz/E3SM_analysis/20210112.A_WCYCL1950S_CMIP6_HR.ne120_oRRS18v3_ICG.unc06
case = 20210112.A_WCYCL1950S_CMIP6_HR.ne120_oRRS18v3_ICG.unc06
www = /lcrc/group/e3sm/public_html/diagnostic_output/ac.golaz/E3SM/v1-HR
partition = compute
e3sm_unified = latest

[climo]
active = True
years = "137-155",

  [[ atm_monthly_720x1440_nco ]]
  input_files = "cam.h0"
  mapping_file = /home/ac.zender/data/maps/map_ne120np4_to_cmip6_720x1440_nco.20190601.nc

[ts]
active = True
years = "137:155:1",

  [[ atm_monthly_720x1440_nco ]]
  input_files = "cam.h0"
  mapping_file = /home/ac.zender/data/maps/map_ne120np4_to_cmip6_720x1440_nco.20190601.nc
  vars = "FSNTOA,FLUT,FSNT,FLNT,FSNS,FLNS,SHFLX,QFLX,TAUX,TAUY,PRECC,PRECL,PRECSC,PRECSL,TS,TREFHT,CLDTOT,CLDHGH,CLDMED,CLDLOW"

  [[ atm_monthly_glb ]]
  input_files = "cam.h0"
  mapping_file = "glb"
  vars = "FSNTOA,FLUT,FSNT,FLNT,FSNS,FLNS,SHFLX,QFLX,TAUX,TAUY,PRECC,PRECL,PRECSC,PRECSL,TS,TREFHT,CLDTOT,CLDHGH,CLDMED,CLDLOW"

[glb]
active = False

[e3sm_diags]
active = True
years = "137-155",

  [[ atm_monthly_720x1440_nco ]]
  short_name = 'unc06'
  grid = '720x1440_nco'
  reference_data_path = '/lcrc/soft/climate/e3sm_diags_data/obs_for_e3sm_diags/climatology'
  output_format_subplot = "pdf",

[e3sm_diags_vs_model]
active = False

[amwg]
active = False

[mpas_analysis]
active = True
walltime = "24:00:00"
parallelTaskCount = 6
subdir_ocean = run
subdir_ice = run
ts_years = "137-155",
enso_years = "137-155",
climo_years ="137-155",
mesh = "oRRS18to6v3"