WCRP-CORDEX / cordex-cmip6-cv

Controlled Vocabulary (CV) for use in CORDEX
BSD 3-Clause "New" or "Revised" License
1 stars 6 forks source link

required global attributes #7

Closed larsbuntemeyer closed 7 months ago

larsbuntemeyer commented 2 years ago

It's not clear to me what global attributes should be required for CORDEX-CMIP6, e.g., for CMIP6, we have

"required_global_attributes":[
            "Conventions",
        "CORDEX_domain",
            "activity_id",
            "creation_date",
            "data_specs_version",
        "driving_experiment_id",
        "driving_institution_id",
        "driving_source_id",
        "driving_variant_label",
            "experiment_id",
            "forcing_index",
            "frequency",
            "further_info_url",
            "grid",
            "grid_label",
            "initialization_index",
            "institution",
            "institution_id",
            "license",
            "nominal_resolution",
            "physics_index",
            "product",
            "realization_index",
            "realm",
            "source",
            "source_id",
            "source_type",
            "table_id",
            "tracking_id",
            "variable_id",
            "variant_label"
        ],
larsbuntemeyer commented 2 years ago

These are global attributes from CORDEX-CMIP5:

"Conventions", 
"contact",
"creation_date",
"experiment", 
"experiment_id"
"driving_experiment",
"driving_model_id",
"driving_model_ensemble_member",
"driving_experiment_name",
"frequency",
"institution", 
"institute_id",
"model_id",
"rcm_version_id",
"project_id",
"CORDEX_domain",
"product",
"references",
"tracking_id"

There are some obvious attributes required by CORDEX, e.g., CORDEX_domain, driving_*, etc... but there is lots of open question here since the current CORDEX archive specifications are outdated (based on CMIP5 vocabulary), e.g.

Suggestions to change following global attributes

CORDEX-CMIP6 attribute corresponding CORDEX-CMIP5 attribute alternative CORDEX-CMIP6 attribute (using parent vocabulary)
source_id model_id
institution_id institute_id
driving_source_variant_label driving_model_ensemble_member parent_model_variant_label
driving_source_id driving_model_id parent_source_id
driving_experiment_id driving_experiment_name parent_experiment_id
activity_id project_id

See also here: https://docs.google.com/document/d/1h0r8RZr_f3-8egBMMh7aqLwy3snpD6_MrDz1q8n5XUk/edit

larsbuntemeyer commented 2 years ago

There seems to be a problem in trying to avoid the parent_experiment_id attribute which seems to be required by the experiment_id attribute in the CV. In general, if i look at the cmor source code, it seems to handle some attributes in a special way, i should write an extensive issue here...

UPDATE: This issue is solved here: https://github.com/PCMDI/cmor/issues/677

larsbuntemeyer commented 1 year ago

Update (30.01.2023)

Following our discussion, the preliminary set of global attributes:

required

recommended

DRS elements


Additional depedent attributes that give more detailed meta info will be derived and can be filled automatically.

larsbuntemeyer commented 1 year ago

here is the current table:

https://docs.google.com/spreadsheets/d/1xlbakqx3btSzT5Ke_q4GiQuK3Js80qNbQjSP4wUsAlY/edit?usp=sharing

larsbuntemeyer commented 7 months ago

I added grid to required global attributes, so it is up to date with our published specs:

{
    "required_global_attributes": [
        "activity_id",
        "contact",
        "Conventions",
        "creation_date",
        "domain",
        "domain_id",
        "driving_experiment",
        "driving_experiment_id",
        "driving_institution_id",
        "driving_source_id",
        "driving_variant_label",
        "frequency",
        "grid",
        "institution",
        "institution_id",
        "license",
        "mip_era",
        "product",
        "project_id",
        "source",
        "source_id",
        "source_type",
        "tracking_id",
        "variable_id",
        "version_realization"
    ]
}