ai2cm / fv3net

explore the FV3 data for parameterization
MIT License
16 stars 3 forks source link

Cluster-based training of machine learning (ML) models cannot handle the cloud-style path strings in the `catalog.yaml` file. #2219

Closed mrudko closed 1 year ago

mrudko commented 1 year ago

Training machine learning (ML) models require access to the grid information of the domain, wind rotation matrix parameters and land_sea_mask data. The data are stored in .zarr format with the locations to the datasets being specified in the catalog.yaml file within vcm directory. The paths are given in the form of "gs"-strings - the style of the path strings used on the cloud platform. This pull request introduces the code modifications needed for performing ML training on supercomputing systems.

Significant internal changes:

Resolves #2204 (partially)