A spinup acceleration tool for land surface model (LSM) family of ORCHIDEE.
Concept: The proposed machine-learning (ML)-enabled spin-up acceleration procedure (MLA) predicts the steady-state of any land pixel of the full model domain after training on a representative subset of pixels. As the computational efficiency of the current generation of LSMs scales linearly with the number of pixels and years simulated, MLA reduces the computation time quasi-linearly with the number of pixels predicted by ML.
Documentation of aims, concepts, workflows are described in Sun et al.202 [open-source]: https://onlinelibrary.wiley.com/doi/full/10.1111/gcb.16623
The SPINacc package includes:
Here are the steps to launch the different tasks of this repository (and the reproducibility tests associated):
git clone git@github.com:CALIPSO-project/SPINacc.git
(The detail of each tasks of the tool is provided in docs/documentation.txt)
The different tasks are (the number of tasks does not correspond to sequence - YET):
Task 1 [optional]: Evaluates the impact of varying the number of K-means clusters on model performance, setting a default of 4 clusters and producing a ‘dist_all.png’ graph.
Task 2 performs the clustering using a K mean algorithm and saves the information on the location of the selected pixels (files starting with 'ID'). The location of the selected pixel (red) for a given PFT and all pixel with a cover fraction exceeding 'cluster_thres' [defined in varlist.json] (grey) are plotted in the figures 'ClustRes_PFT**.png'. Example of PFT2 is shown here:
Task 3: Creates compressed forcing files for ORCHIDEE, containing data for selected pixels only, aligned on a global pseudo-grid for efficient pixel-level simulations, with file specifications listed in varlist.json.
Task 4 performs the ML training on results from ORCHIDEE simulation using the compressed forcing (production mode: resp-format=compressed) or global forcing (debug mode: resp-format=global), extrapolation to a global grid and writing the state variables into global restart files for ORCHIDEE. In debug mode Task 4 also performs the evaluation of ML training outputs vs real model outputs.
Task 5 [optional]: Visualizes ML performance from Task 3, offering two evaluation modes: global pixel evaluation and leave-one-cross-validation (LOOCV) for training sites, generating plots for various state variables at the PFT level, including comparisons of ML predictions with conventional spinup data.
The configuration file has been updated to include new parameters that control the execution of reproducibility tests for each task. These parameters are:
config[17]: Controls the reproducibility test for Task 1. config[19]: Controls the reproducibility test for Task 2. config[21]: Controls the reproducibility test for Task 3. config[23]: Controls the reproducibility test for Task 4.
For each parameter, setting the value to 1 enables the reproducibility test for the corresponding task, while setting it to 0 disables it.