NCAR / esm-collection-spec

Earth System Model Collection specification
Apache License 2.0
13 stars 7 forks source link
data-catalog esm-catalog-specification intake-esm xdev

:warning: Archived Repository Notice

This repository has been archived and is no longer actively maintained. The code and information contained within this repository may be outdated and may no longer be relevant to current usage.

Development of this project has moved to https://github.com/intake/intake-esm and the up-to-date spec resides here. Please check the https://github.com/intake/intake-esm repository for the latest updates and actively maintained specification.

ESM Catalog Specification

GitHub Workflow Status Zenodo

The Earth System Model Catalog specification describes a way of cataloging large datasets with a homogeneous metadata structure, such as those produced by the Coupled Model Intercomparison Project of the World Climate Research Programme. It was designed within the Pangeo project, growing out of various ad-hoc attempts at building catalogs of convenience for CMIP6 and related dataset in the months before the 2019 CMIP6 Hackathon.

Background and Related Projects

Standards

via https://xkcd.com/927/

We are guilty of creating a new standard rather than reusing one of the many reasonable alternatives already in existence. Here we try to justify this choice.

Ultimately, with sufficient time, we probably could have adopted any of the above tools and made it work for our needs. The decision to make a new spec was ultimately driven by the timeline of the CMIP6 hackathon--it seemed like the fastest route.

The Specification

collection-spec/ directory contains the esm collection core specification plus examples and information about the validation schema and validation tool.

Intake driver: intake-esm

Intake-esm is a data cataloging utility built on top of intake, pandas, and xarray. Intake-esm provides functionality for: