remi-kazeroni commented 1 year ago

This issue documents the round of recipe testing performed using the Core release candidate v2.8.0rc2.

Release process

System and settings

`conda`/`mamba`

(base) mamba --version
mamba 1.3.1
conda 23.1.0

Git branches and state

Tue 21 Mar 13:02:47 CET 2023

(base) :~/ESMValTool 
$ git status
On branch main
Your branch is up to date with 'origin/main'.

nothing to commit, working tree clean

(base) :~/ESMValCore 
$ git status
On branch v2.8.x
Your branch is up to date with 'origin/v2.8.x'.

nothing to commit, working tree clean

Installation and environment

$ cd ~/ESMValTool
$ mamba env create -n tool_280rc2 -f environment.yml
$ conda activate tool_280rc2
$ pip install --editable '.[develop]'
$ cd ~/ESMValCore
$ pip install --editable '.[develop]'

Config user file

Main options: all default except search_esgf: when_missing

```yaml output_dir: ./esmvaltool_output max_parallel_tasks: 8 log_level: debug exit_on_warning: false output_file_type: png remove_preproc_dir: true compress_netcdf: false save_intermediary_cubes: false config_developer_file: null profile_diagnostic: false # Site-specific entries: DKRZ-Levante search_esgf: when_missing download_dir: /work/bd0854/DATA/ESMValTool2/download auxiliary_data_dir: /work/bd0854/DATA/ESMValTool2/AUX rootpath: CMIP6: /work/bd0854/DATA/ESMValTool2/CMIP6_DKRZ CMIP5: /work/bd0854/DATA/ESMValTool2/CMIP5_DKRZ CMIP3: /work/bd0854/DATA/ESMValTool2/CMIP3 CORDEX: /work/ik1017/C3SCORDEX/data/c3s-cordex/output OBS: /work/bd0854/DATA/ESMValTool2/OBS OBS6: /work/bd0854/DATA/ESMValTool2/OBS obs4MIPs: /work/bd0854/DATA/ESMValTool2/OBS ana4mips: /work/bd0854/DATA/ESMValTool2/OBS native6: /work/bd0854/DATA/ESMValTool2/RAWOBS RAWOBS: /work/bd0854/DATA/ESMValTool2/RAWOBS drs: CMIP6: DKRZ CMIP5: DKRZ CMIP3: DKRZ CORDEX: BADC obs4MIPs: default ana4mips: default OBS: default OBS6: default native6: default ```

ESMValTool version

$ esmvaltool version
ESMValCore: 2.8.0rc2
ESMValTool: 2.8.0.dev111+g6faf263f6

Environment file

tool_280rc2.txt

Compute resources used

I used the newly added generate.py script. I made some modifications to it to enable the release manager to run all 150 recipes in one go, by doing python generate.py and adjusted SLURM settings for all "complicated" recipes. I will open a PR shortly to provide more details on that.

On DKRZ-Levante

When possible (small to medium jobs, in practice 4 jobs per node):
```
#SBATCH --partition=interactive
#SBATCH --mem=64G
```
Large jobs (nodes not shared)
```
#SBATCH --partition=compute
```

Note: this is the second and final round of testing for v2.8.0. I will publish the overview website and output of the comparison tool in this issue very soon. And then I will tag the community to check the output. Stay tuned!

remi-kazeroni commented 1 year ago

Overview of the results

Numbers of successes and failures

The first round of recipe testing produced:

148 successful runs!! (i.e. where a index.html file was generated)
2 failed runs:

Recipe failures

Recipe	Problem	Related issue PR
recipe_autoassess_landsurface_soilmoisture	known missing climatology files (non-public)	marked as broken in https://github.com/ESMValGroup/ESMValTool/issues/3103
recipe_check_obs	known derivation issue for ERA5	https://github.com/ESMValGroup/ESMValCore/issues/1388

For comparison, we released ESMValTool 2.7.0 with 4 non-working recipes (this could have been 5 if we used a stricter policy on missing data as done for this round of testing)

Overview webpage and path to data

Webpage: https://esmvaltool.dkrz.de/shared/esmvaltool/v2.8.0rc2/
Path to the runs on Levante (with preproc dirs for failed runs): /work/bd0854/b309192/recipe_testing/recipe_testing_v2p8/v08rc2/scripts/esmvaltool_output

Note: I will soon make a new post with a markdown list so that contributors can tick boxes after checking the output of their favourite recipes. After that, I'll tag the community.

And thanks very much to everyone who helped testing, fixing, maintaining recipes in the previous round of testing! It is very enjoyable to get results like this with v2.8.0rc2.

remi-kazeroni commented 1 year ago

Hi @ESMValGroup/esmvaltool-developmentteam and @ESMValGroup/esmvaltool-recipe-maintainers, the results from the second and last round of recipe testing for the release of ESMValTool and ESMValCore v2.8 are now available. I would be very grateful if you could take a look at the output of your favourite recipes (see list below) and tick the boxes if the output look good to you. If that is not the case, please report the issue by editing the list below or posting in this issue.

Deadline: Tuesday, March 28, noon (GMT) Release of ESMValTool v2.8 is scheduled for that day.

Link to previous runs with v2.7.0: https://esmvaltool.dkrz.de/shared/esmvaltool/v2.7.0/debug.html
Link to current runs with v2.8.0rc2: https://esmvaltool.dkrz.de/shared/esmvaltool/v2.8.0rc2/

Some guidelines on how to inspect runs:

look at plots from current run vs previous release run: most of them will be identical, but if Matplotlib has changed some plotting feature, images may look slightly different so the comparison script may report them if the difference is larger than the threshold - but Mark I eyeball inspection will show they are identical
other plots will differ due to changes in plot settings (different colours, axes etc) due to updated settings from the diagnostic developers: if they look similar enough, then it’s fine
report (and subsequently open issues) if you notice major differences in plots

Output comparison between recipes run with Core `v2.8.0rc2` and the previous stable released version `v2.7.0`

Below is the list of 150 recipes currently available in the main branch. The comparison tool returns:

Action required: 120 out of 147 recipe runs need to be inspected by a human.

See complete output in: compare_v280_output.txt

List of recipes to be checked:

[x] recipe_albedolandcover.yml
[x] recipe_anav13jclim.yml
[x] recipe_arctic_ocean.yml
[x] recipe_autoassess_landsurface_permafrost.yml
[ ] recipe_autoassess_landsurface_soilmoisture.yml
[x] recipe_autoassess_landsurface_surfrad.yml
[x] recipe_autoassess_stratosphere.yml
[ ] recipe_capacity_factor.yml
[ ] recipe_carvalhais14nat.yml
[x] recipe_climate_change_hotspot.yml
[ ] recipe_climwip_brunner2019_med.yml
[ ] recipe_climwip_brunner20esd.yml
[ ] recipe_climwip_test_basic.yml
[ ] recipe_climwip_test_performance_sigma.yml
[x] recipe_cmug_h2o.yml
[ ] recipe_collins13ipcc.yml
[ ] recipe_combined_indices.yml
[ ] recipe_consecdrydays.yml
[x] recipe_cox18nature.yml
[ ] recipe_cvdp.yml
[x] recipe_deangelis15nat.yml
[ ] recipe_diurnal_temperature_index.yml
[x] recipe_eady_growth_rate.yml
[x] recipe_ecs.yml
[x] recipe_ecs_constraints.yml
[x] recipe_ecs_scatter.yml
[ ] recipe_ensclus.yml
[x] recipe_esacci_lst.yml
[x] recipe_esacci_oc.yml
[ ] recipe_extreme_events.yml
[ ] recipe_extreme_index.yml
[x] recipe_eyring06jgr.yml
[x] recipe_eyring13jgr_12.yml
[x] recipe_gier2020bg.yml
[ ] recipe_heatwaves_coldwaves.yml
[ ] recipe_hyint.yml
[ ] recipe_hyint_extreme_events.yml
[ ] recipe_impact.yml
[ ] recipe_kcs.yml
[ ] recipe_landcover.yml
[x] recipe_li17natcc.yml
[x] recipe_martin18grl.yml
[x] recipe_meehl20sciadv.yml
[ ] recipe_miles_block.yml
[ ] recipe_miles_eof.yml
[ ] recipe_miles_regimes.yml
[x] recipe_modes_of_variability.yml
[ ] recipe_multimodel_products.yml
[x] recipe_ocean_Landschuetzer2016.yml
[x] recipe_ocean_amoc.yml
[x] recipe_ocean_bgc.yml
[x] recipe_ocean_example.yml
[x] recipe_ocean_ice_extent.yml
[x] recipe_ocean_multimap.yml
[x] recipe_ocean_quadmap.yml
[x] recipe_ocean_scalar_fields.yml
[ ] recipe_perfmetrics_CMIP5.yml
[ ] recipe_perfmetrics_CMIP5_4cds.yml
[ ] recipe_perfmetrics_land_CMIP5.yml
[x] recipe_psyplot.yml
[x] recipe_pv_capacity_factor.yml
[ ] recipe_quantilebias.yml
[ ] recipe_radiation_budget.yml
[ ] recipe_rainfarm.yml
[ ] recipe_runoff_et.yml
[ ] recipe_russell18jgr.yml
[x] recipe_schlund20esd.yml
[x] recipe_sea_surface_salinity.yml
[x] recipe_seaice.yml
[x] recipe_seaice_drift.yml
[x] recipe_seaice_feedback.yml
[ ] recipe_shapeselect.yml
[ ] recipe_smpi.yml
[ ] recipe_smpi_4cds.yml
[ ] recipe_snowalbedo.yml
[x] recipe_spei.yml
[x] recipe_tcr.yml
[x] recipe_tebaldi21esd.yml
[ ] recipe_thermodyn_diagtool.yml
[x] recipe_toymodel.yml
[x] recipe_validation.yml
[x] recipe_validation_CMIP6.yml
[x] recipe_wenzel14jgr.yml
[x] recipe_wenzel16jclim.yml
[x] recipe_wenzel16nat.yml
[ ] recipe_williams09climdyn_CREM.yml
[ ] recipe_zmnam.yml
[x] bock20jgr/recipe_bock20jgr_fig_1-4.yml
[x] bock20jgr/recipe_bock20jgr_fig_6-7.yml
[x] bock20jgr/recipe_bock20jgr_fig_8-10.yml
[x] clouds/recipe_clouds_bias.yml
[x] clouds/recipe_clouds_ipcc.yml
[x] clouds/recipe_lauer13jclim.yml
[x] clouds/recipe_lauer22jclim_fig1_clim.yml
[x] clouds/recipe_lauer22jclim_fig1_clim_amip.yml
[x] clouds/recipe_lauer22jclim_fig2_taylor.yml
[x] clouds/recipe_lauer22jclim_fig2_taylor_amip.yml
[x] clouds/recipe_lauer22jclim_fig3-4_zonal.yml
[x] clouds/recipe_lauer22jclim_fig5_lifrac.yml
[x] clouds/recipe_lauer22jclim_fig6_interannual.yml
[x] clouds/recipe_lauer22jclim_fig7_seas.yml
[x] clouds/recipe_lauer22jclim_fig8_dyn.yml
[x] clouds/recipe_lauer22jclim_fig9-11ab_scatter.yml
[x] clouds/recipe_lauer22jclim_fig9-11c_pdf.yml
[ ] cmorizers/recipe_daily_era5.yml
[ ] cmorizers/recipe_era5-land.yml
[ ] examples/recipe_check_obs.yml
[x] examples/recipe_concatenate_exps.yml
[ ] examples/recipe_correlation.yml
[x] examples/recipe_decadal.yml
[x] examples/recipe_extract_shape.yml
[x] examples/recipe_julia.yml
[x] examples/recipe_my_personal_diagnostic.yml
[x] examples/recipe_ncl.yml
[x] examples/recipe_preprocessor_derive_test.yml
[x] examples/recipe_preprocessor_test.yml
[x] examples/recipe_python.yml
[x] examples/recipe_r.yml
[x] examples/recipe_variable_groups.yml
[ ] hydrology/recipe_globwat.yml
[ ] hydrology/recipe_hydro_forcing.yml
[ ] hydrology/recipe_hype.yml
[ ] hydrology/recipe_lisflood.yml
[ ] hydrology/recipe_marrmot.yml
[ ] hydrology/recipe_pcrglobwb.yml
[ ] hydrology/recipe_wflow.yml
[x] ipccwg1ar5ch9/recipe_flato13ipcc_figure_914.yml
[x] ipccwg1ar5ch9/recipe_flato13ipcc_figure_924.yml
[x] ipccwg1ar5ch9/recipe_flato13ipcc_figure_942.yml
[x] ipccwg1ar5ch9/recipe_flato13ipcc_figure_945a.yml
[x] ipccwg1ar5ch9/recipe_flato13ipcc_figure_96.yml
[x] ipccwg1ar5ch9/recipe_flato13ipcc_figure_98.yml
[x] ipccwg1ar5ch9/recipe_flato13ipcc_figures_926_927.yml
[x] ipccwg1ar5ch9/recipe_flato13ipcc_figures_92_95.yml
[x] ipccwg1ar5ch9/recipe_flato13ipcc_figures_938_941_cmip3.yml
[x] ipccwg1ar5ch9/recipe_flato13ipcc_figures_938_941_cmip6.yml
[x] ipccwg1ar5ch9/recipe_weigel21gmd_figures_13_16.yml
[x] ipccwg1ar6ch3/recipe_ipccwg1ar6ch3_atmosphere.yml
[x] ipccwg1ar6ch3/recipe_ipccwg1ar6ch3_fig_3_19.yml
[x] ipccwg1ar6ch3/recipe_ipccwg1ar6ch3_fig_3_42_a.yml
[x] ipccwg1ar6ch3/recipe_ipccwg1ar6ch3_fig_3_42_b.yml
[x] ipccwg1ar6ch3/recipe_ipccwg1ar6ch3_fig_3_43.yml
[x] ipccwg1ar6ch3/recipe_ipccwg1ar6ch3_fig_3_9.yml
[x] monitor/recipe_monitor.yml
[x] monitor/recipe_monitor_with_refs.yml
[x] mpqb/recipe_mpqb_xch4.yml
[x] schlund20jgr/recipe_schlund20jgr_gpp_abs_rcp85.yml
[x] schlund20jgr/recipe_schlund20jgr_gpp_change_1pct.yml
[x] schlund20jgr/recipe_schlund20jgr_gpp_change_rcp85.yml
[x] testing/recipe_deangelis15nat_fig1_fast.yml

bouweandela commented 1 year ago

Note that the tool can now find many more files providing supplementary variables (ancillary variables and cell measures), provided that fx_variables is not used in the recipe. This means that calculations done by the preprocessor functions area_statistics, mask_landsea, mask_landseaice, volume_statistics, and weighting_landsea_fraction are more accurate. Numerical differences with previous versions are therefore expected. See Supplementary variables (ancillary variables and cell measures) in the preprocessor documentation for more information.

katjaweigel commented 1 year ago

recipe_deangelis15nat small differences in values, could be caused by the change @bouweandela just mentioned and are small enough to be ignored. But for one type of figure (recipe_deangelis15nat_20230321_121831/plots/deangelisf2ext/deangelisf2ext/ACCESS1-3.png and similar) something went wrong with the canvas size of the figures (the axis labels are longer than the axis and are now partly missing, in V2.7 the canvas was large enough), I should have a look at that.
recipe_li17natcc differences in values, especially CMIP5 - it uses mask_landsea to get global ts only over the sea so this could be the reason
recipe_cmug_h2o: I found a small issue (wrong position of figure frames) but that one I overlooked already during the last release (although I corrected it for another recipe then) - so not related to this release but I should fix it at some point.
recipe_spei small differences, could be related to the change from SPEI.R V1.7 to SPEI.R V1.8 (small enough to be ignored)

valeriupredoi commented 1 year ago

very many thanks @katjaweigel :beer: Anything you'd reckon can't be fixed with a short (in time) PR?

katjaweigel commented 1 year ago

@valeriupredoi I think the frame for recipe_cmug_h2o and I hope the canvas for recipe_deangelis15nat, but for the second I have to find out how change it, first. (Both should be fixed in the ESMValTool diagnostics.)

valeriupredoi commented 1 year ago

godspeed with that @katjaweigel :racehorse:

katjaweigel commented 1 year ago

Unfortunately I cannot reproduce the issue with the figures from recipe_deangelis15nat: Figure from test run: https://esmvaltool.dkrz.de/shared/esmvaltool/v2.8.0rc2/ ACCESS1-0 Figure from my own test with the new Core, reduced version of the recipe (/work/bd1083/b380216/output/recipe_deangelis15nat_20230323_172923/):

valeriupredoi commented 1 year ago

@katjaweigel have you recreated the environment to pull in all the dependencies the testing environment used?

katjaweigel commented 1 year ago

@valeriupredoi Thanks, you are right: I installed the new environment, but I forgot to turn it on, sorry!

katjaweigel commented 1 year ago

I made a issue (#3132) and a PR (#3133) now to change the plot issues in recipe_deangelis15nat and recipe_cmug_h2o (both are really small changes).

valeriupredoi commented 1 year ago

@katjaweigel that's brilliant, very many thanks, I'll have a look in a jiffy 🍺

remi-kazeroni commented 1 year ago

I made a issue (#3132) and a PR (#3133) now to change the plot issues in recipe_deangelis15nat and recipe_cmug_h2o (both are really small changes).

Thanks for that @katjaweigel. The new runs (and new plots) are available on the same website: https://esmvaltool.dkrz.de/shared/esmvaltool/v2.8.0rc2/

katjaweigel commented 1 year ago

Thanks a lot @remi-kazeroni and @valeriupredoi!

remi-kazeroni commented 1 year ago

Thanks everyone for checking the recipe results, that was very helpful for the release management team 👍 I see that about 2/3 of the recipes were checked and approved which is good enough to proceed with the release of ESMValTool v2.8.0. I'm closing this issue now. Nevertheless, feel free to continue checking recipe output later on and mark those that were checked. If needed, a new issue can be opened to document potential problems noticed later on.

bouweandela commented 1 year ago

Hi @remi-kazeroni, thanks for the nice overview. I noticed that several recipes that are not checked in the list above are listed as OK in the comparison tool output that you posted. Is this on purpose? For example:

..
recipe_combined_indices.yml: OK
..
recipe_consecdrydays.yml: OK
..

remi-kazeroni commented 1 year ago

Hi @bouweandela, I overlooked that and did not put any [x] for the 27 recipes that were reported as unchanged by the comparison tool. I can still do that if you like. My experience is that it would still be better that someone quickly checks the output manually. We have seen problems that went unnoticed from release to release (like masking of 0s) and the comparison tool would report that results have not changed since the past release...

bouweandela commented 1 year ago

We have seen problems that went unnoticed from release to release (like masking of 0s)

That sounds like a serious issue with the comparison tool. Is it reported somewhere? The whole point of having a comparison tool is that you can rely on things being OK if it says they are OK.

remi-kazeroni commented 1 year ago

We have seen problems that went unnoticed from release to release (like masking of 0s)

That sounds like a serious issue with the comparison tool. Is it reported somewhere? The whole point of having a comparison tool is that you can rely on things being OK if it says they are OK.

This was fixed in https://github.com/ESMValGroup/ESMValCore/pull/1823 and is in the v2.8.0 release. I think the point I'm trying to make is: we do not have a robust mechanism in place to record "known good output" for recipes merged into main. If we compare recipe output affected by unnoticed bugs (like masking of 0s) or if outputs change because of some improvements (e.g. 1609), we would somehow need to record the "known good output" again. As long as this is not in place (maybe one day as part of a recipe test workflow), I would not fully rely on the OK from the comparison tool because there could be some uncertainty in the "known good output". That is why I personally feel it is safer to take a look at the final recipe results for a release. Nevertheless, the comparison tool has been very useful for me in various cases: comparing output between rcs, review of some PRs, ...

bouweandela commented 1 year ago

Would you say that the recipes with a checkmark above are known good output then? It would be good to take this to the tech lead meeting.

remi-kazeroni commented 1 year ago

Would you say that the recipes with a checkmark above are known good output then? It would be good to take this to the tech lead meeting.

After a release with quite a few important enhancements and bugfixes, I think yes. Known good output would be those with a checkmark. Maybe it is not necessary that all recipe output are checked after each release, but just once in a while (once per year?) or if the Tech Lead Team says that there would be good reasons (major Core changes) to justify that.

ESMValGroup / ESMValTool

Recipe testing and output comparison for release 2.8.0 - Final Core release candidate rc2 #3127

Release process

System and settings

`conda`/`mamba`

Git branches and state

Installation and environment

Config user file

ESMValTool version

Environment file

Compute resources used

Overview of the results

Numbers of successes and failures

Recipe failures

Overview webpage and path to data

Output comparison between recipes run with Core `v2.8.0rc2` and the previous stable released version `v2.7.0`

ESMValGroup / ESMValTool

Recipe testing and output comparison for release 2.8.0 - Final Core release candidate rc2 #3127

Release process

System and settings

conda/mamba

Git branches and state

Installation and environment

Config user file

ESMValTool version

Environment file

Compute resources used

Overview of the results

Numbers of successes and failures

Recipe failures

Overview webpage and path to data

Output comparison between recipes run with Core v2.8.0rc2 and the previous stable released version v2.7.0

`conda`/`mamba`

Output comparison between recipes run with Core `v2.8.0rc2` and the previous stable released version `v2.7.0`