Open bouweandela opened 2 years ago
yis, let's!
A difficulty is that there are apparently 3 different DRS entries used in this project:
what are the numbers (values in the dict) denoting? Also, I'd imagine source_id is equivalent to model? That'd make it two DRS's - actually one coz we can map source_id to model (somehow)
what are the numbers (values in the dict) denoting?
I think it is the number of records on ESGF with the same dataset_id that use that DRS. Here is the code to get this info with esgf-pyclient:
from pyesgf.search import SearchConnection
conn = SearchConnection('https://esgf-data.dkrz.de/esg-search',
distrib=True)
ctx = conn.new_context(project='CREATE-IP', facets='directory_format_template_')
dict(ctx.facet_counts)['directory_format_template_']
Also, I'd imagine source_id is equivalent to model?
It seems this can be different. For example, the ERA5 dataset is produced using the CY41R2 IFS model, if I understand it correctly.
The CREATE-IP project is the follow up to the ana4MIPs project. I think it would be useful to add support for this in the ESMValCore, as this would allow e.g. automatically downloading reanalysis datasets that require no further CMORization.
Example integration of CREATE-IP in ESMValCore
config-developer.yml entry could look something like this?
but may need to add also
model
?A difficulty is that there are apparently 3 different DRS entries used in this project:
Data finding