xarray-contrib / datatree

WIP implementation of a tree-like hierarchical data structure for xarray.
https://xarray-datatree.readthedocs.io
Apache License 2.0
162 stars 43 forks source link

Example datatree for use in tutorial documentation #100

Open TomNicholas opened 2 years ago

TomNicholas commented 2 years ago

What would help me enormously with writing documentation would be a killer example datatree, which I could open and use to demonstrate use of all types of methods. Just like we have the "air_temperature" example dataset used in the main xarray documentation.

To be as useful as possible, this example tree should hit a few criteria:

A really good inspiration is this pseudo-structure provided in https://github.com/pydata/xarray/issues/4118:

image

This would hit all of the criteria above, if it actually existed somewhere I could find!

What I would like is for people who have more familiarity with real geo-science data products to help me make this killer example tree, or at least point me towards data that I might use.

If we have multiple good suggestions I could make multiple different examples to use, but I think I would prefer one really good one to multiple quite good ones. Alternatively any extras could end up getting used for some future example notebooks though.

@jhamman @jbusecke @rabernat @alexamici @joshmoore

joshmoore commented 2 years ago

I defer to @thewtex though I assume that "small" might be an issue :)

TomNicholas commented 1 year ago

@andersy005 @jhamman just a ping to say that if either of you can help me get some data like this I could really push forward with writing cool documentation for datatree!

andersy005 commented 1 year ago

@TomNicholas, i have this on my today's TODO list. I should have a few samples ready by the end of the day.

joshmoore commented 1 year ago

Maybe the community has ideas as well https://stackoverflow.com/questions/73160955/open-root-zarr-with-multiple-groups-using-xarray :smile:

andersy005 commented 1 year ago

@TomNicholas, i now have two datasets (one from CMIP6 and another one from CESM2 Large Ensemble)

CESM2 Large Ensemble ```python DataTree('None', parent=None) ├── DataTree('ocn') │ ├── DataTree('historical') │ │ └── DataTree('monthly') │ │ ├── DataTree('smbb') │ │ │ Dimensions: (member_id: 1, time: 6, z_t: 1, nlat: 384, nlon: 320, d2: 2) │ │ │ Coordinates: │ │ │ * member_id (member_id) │ │ │ * z_t (z_t) float32 500.0 │ │ │ Dimensions without coordinates: nlat, nlon, d2 │ │ │ Data variables: │ │ │ O2 (member_id, time, z_t, nlat, nlon) float32 dask.array │ │ │ Attributes: │ │ │ Conventions: CF-1.0; http://www.cgd.ucar.edu/cms/eaton/netcdf... │ │ │ calendar: All years have exactly 365 days. │ │ │ cell_methods: cell_methods = time: mean ==> the variable value... │ │ │ contents: Diagnostic and Prognostic Variables │ │ │ model_doi_url: https://doi.org/10.5065/D67H1H0V │ │ │ revision: $Id$ │ │ │ source: CCSM POP2, the CCSM Ocean Component │ │ │ time_period_freq: month_1 │ │ │ intake_esm_dataset_key: ocn/historical/monthly/smbb │ │ └── DataTree('cmip6') │ │ Dimensions: (member_id: 1, time: 6, z_t: 1, nlat: 384, nlon: 320, d2: 2) │ │ Coordinates: │ │ * member_id (member_id) │ │ * z_t (z_t) float32 500.0 │ │ Dimensions without coordinates: nlat, nlon, d2 │ │ Data variables: │ │ O2 (member_id, time, z_t, nlat, nlon) float32 dask.array │ │ Attributes: │ │ Conventions: CF-1.0; http://www.cgd.ucar.edu/cms/eaton/netcdf... │ │ calendar: All years have exactly 365 days. │ │ cell_methods: cell_methods = time: mean ==> the variable value... │ │ contents: Diagnostic and Prognostic Variables │ │ model_doi_url: https://doi.org/10.5065/D67H1H0V │ │ revision: $Id$ │ │ source: CCSM POP2, the CCSM Ocean Component │ │ start_time: This dataset was created on 2020-07-18 at 07:26:... │ │ time_period_freq: month_1 │ │ intake_esm_dataset_key: ocn/historical/monthly/cmip6 │ └── DataTree('ssp370') │ └── DataTree('monthly') │ ├── DataTree('smbb') │ │ Dimensions: (member_id: 1, time: 6, z_t: 1, nlat: 384, nlon: 320, d2: 2) │ │ Coordinates: │ │ * member_id (member_id) │ │ * z_t (z_t) float32 500.0 │ │ Dimensions without coordinates: nlat, nlon, d2 │ │ Data variables: │ │ O2 (member_id, time, z_t, nlat, nlon) float32 dask.array │ │ Attributes: │ │ Conventions: CF-1.0; http://www.cgd.ucar.edu/cms/eaton/netcdf... │ │ calendar: All years have exactly 365 days. │ │ cell_methods: cell_methods = time: mean ==> the variable value... │ │ contents: Diagnostic and Prognostic Variables │ │ model_doi_url: https://doi.org/10.5065/D67H1H0V │ │ revision: $Id$ │ │ source: CCSM POP2, the CCSM Ocean Component │ │ time_period_freq: month_1 │ │ intake_esm_dataset_key: ocn/ssp370/monthly/smbb │ └── DataTree('cmip6') │ Dimensions: (member_id: 1, time: 6, z_t: 1, nlat: 384, nlon: 320, d2: 2) │ Coordinates: │ * member_id (member_id) │ * z_t (z_t) float32 500.0 │ Dimensions without coordinates: nlat, nlon, d2 │ Data variables: │ O2 (member_id, time, z_t, nlat, nlon) float32 dask.array │ Attributes: │ Conventions: CF-1.0; http://www.cgd.ucar.edu/cms/eaton/netcdf... │ calendar: All years have exactly 365 days. │ cell_methods: cell_methods = time: mean ==> the variable value... │ contents: Diagnostic and Prognostic Variables │ model_doi_url: https://doi.org/10.5065/D67H1H0V │ revision: $Id$ │ source: CCSM POP2, the CCSM Ocean Component │ time_period_freq: month_1 │ intake_esm_dataset_key: ocn/ssp370/monthly/cmip6 └── DataTree('atm') ├── DataTree('historical') │ └── DataTree('monthly') │ ├── DataTree('cmip6') │ │ Dimensions: (member_id: 1, time: 6, lat: 192, lon: 288, nbnd: 2) │ │ Coordinates: │ │ * lat (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0 │ │ * lon (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8 │ │ * member_id (member_id) │ │ Dimensions without coordinates: nbnd │ │ Data variables: │ │ PRECC (member_id, time, lat, lon) float32 dask.array │ │ TREFHT (member_id, time, lat, lon) float32 dask.array │ │ Attributes: │ │ time_period_freq: month_1 │ │ model_doi_url: https://doi.org/10.5065/D67H1H0V │ │ Conventions: CF-1.0 │ │ NCO: netCDF Operators version 4.9.4 (Homepage = http:... │ │ logname: sunseon │ │ topography_file: /mnt/lustre/share/CESM/cesm_input/atm/cam/topo/f... │ │ source: CAM │ │ intake_esm_dataset_key: atm/historical/monthly/cmip6 │ └── DataTree('smbb') │ Dimensions: (member_id: 1, time: 6, lat: 192, lon: 288, nbnd: 2) │ Coordinates: │ * lat (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0 │ * lon (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8 │ * member_id (member_id) │ Dimensions without coordinates: nbnd │ Data variables: │ PRECC (member_id, time, lat, lon) float32 dask.array │ TREFHT (member_id, time, lat, lon) float32 dask.array │ Attributes: │ time_period_freq: month_1 │ model_doi_url: https://doi.org/10.5065/D67H1H0V │ Conventions: CF-1.0 │ logname: sunseon │ topography_file: /mnt/lustre/share/CESM/cesm_input/atm/cam/topo/f... │ source: CAM │ intake_esm_dataset_key: atm/historical/monthly/smbb └── DataTree('ssp370') └── DataTree('monthly') ├── DataTree('cmip6') │ Dimensions: (member_id: 1, time: 6, lat: 192, lon: 288, nbnd: 2) │ Coordinates: │ * lat (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0 │ * lon (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8 │ * member_id (member_id) │ Dimensions without coordinates: nbnd │ Data variables: │ PRECC (member_id, time, lat, lon) float32 dask.array │ TREFHT (member_id, time, lat, lon) float32 dask.array │ Attributes: │ time_period_freq: month_1 │ model_doi_url: https://doi.org/10.5065/D67H1H0V │ Conventions: CF-1.0 │ host: mom1 │ logname: sunseon │ topography_file: /mnt/lustre/share/CESM/cesm_input/atm/cam/topo/f... │ source: CAM │ intake_esm_dataset_key: atm/ssp370/monthly/cmip6 └── DataTree('smbb') Dimensions: (member_id: 1, time: 6, lat: 192, lon: 288, nbnd: 2) Coordinates: * lat (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0 * lon (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8 * member_id (member_id) Dimensions without coordinates: nbnd Data variables: PRECC (member_id, time, lat, lon) float32 dask.array TREFHT (member_id, time, lat, lon) float32 dask.array Attributes: time_period_freq: month_1 model_doi_url: https://doi.org/10.5065/D67H1H0V Conventions: CF-1.0 logname: sunseon topography_file: /mnt/lustre/share/CESM/cesm_input/atm/cam/topo/f... source: CAM intake_esm_dataset_key: atm/ssp370/monthly/smbb ```
CMIP6 ```python DataTree('None', parent=None) ├── DataTree('ScenarioMIP') │ ├── DataTree('MIROC') │ │ └── DataTree('MIROC6') │ │ └── DataTree('ssp370') │ │ ├── DataTree('Lmon') │ │ │ └── DataTree('gn') │ │ │ Dimensions: (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6) │ │ │ Coordinates: │ │ │ * lat (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93 │ │ │ lat_bnds (lat, bnds) float64 ... │ │ │ * lon (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6 │ │ │ lon_bnds (lon, bnds) float64 ... │ │ │ * time (time) datetime64[ns] 2015-01-16T12:00:00 ... 2015-06-16 │ │ │ time_bnds (time, bnds) datetime64[ns] ... │ │ │ * member_id (member_id) object 'r1i1p1f1' │ │ │ Dimensions without coordinates: bnds │ │ │ Data variables: │ │ │ mrso (member_id, time, lat, lon) float32 ... │ │ │ Attributes: (12/48) │ │ │ Conventions: CF-1.7 CMIP-6.2 │ │ │ activity_id: ScenarioMIP AerChemMIP │ │ │ branch_method: standard │ │ │ branch_time_in_child: 60265.0 │ │ │ branch_time_in_parent: 60265.0 │ │ │ cmor_version: 3.4.0 │ │ │ ... ... │ │ │ variable_id: mrso │ │ │ variant_label: r1i1p1f1 │ │ │ status: 2019-10-29;created;by nhn2@columbia.edu │ │ │ netcdf_tracking_ids: hdl:21.14100/3ba01dc3-ab7e-45d0-882a-66ed2768a642 │ │ │ version_id: v20190627 │ │ │ intake_esm_dataset_key: ScenarioMIP/MIROC/MIROC6/ssp370/Lmon/gn │ │ ├── DataTree('Amon') │ │ │ └── DataTree('gn') │ │ │ Dimensions: (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6) │ │ │ Coordinates: │ │ │ * lat (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93 │ │ │ lat_bnds (lat, bnds) float64 ... │ │ │ * lon (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6 │ │ │ lon_bnds (lon, bnds) float64 ... │ │ │ * time (time) datetime64[ns] 2015-01-16T12:00:00 ... 2015-06-16 │ │ │ time_bnds (time, bnds) datetime64[ns] ... │ │ │ * member_id (member_id) object 'r1i1p1f1' │ │ │ Dimensions without coordinates: bnds │ │ │ Data variables: │ │ │ pr (member_id, time, lat, lon) float32 ... │ │ │ Attributes: (12/48) │ │ │ Conventions: CF-1.7 CMIP-6.2 │ │ │ activity_id: ScenarioMIP AerChemMIP │ │ │ branch_method: standard │ │ │ branch_time_in_child: 60265.0 │ │ │ branch_time_in_parent: 60265.0 │ │ │ cmor_version: 3.4.0 │ │ │ ... ... │ │ │ variable_id: pr │ │ │ variant_label: r1i1p1f1 │ │ │ status: 2019-10-25;created;by nhn2@columbia.edu │ │ │ netcdf_tracking_ids: hdl:21.14100/c23c415d-adca-4e01-8e7c-11617bcfa2bb │ │ │ version_id: v20190627 │ │ │ intake_esm_dataset_key: ScenarioMIP/MIROC/MIROC6/ssp370/Amon/gn │ │ └── DataTree('Omon') │ │ └── DataTree('gn') │ │ Dimensions: (y: 256, x: 360, time: 6, lev: 1, bnds: 2, │ │ member_id: 1, vertices: 4) │ │ Coordinates: (12/13) │ │ latitude (y, x) float32 ... │ │ * lev (lev) float64 1.0 │ │ lev_bnds (lev, bnds) float64 ... │ │ longitude (y, x) float32 ... │ │ sigma_bnds (lev, bnds) float64 ... │ │ * time (time) datetime64[ns] 2015-01-16T12:00:00 ... 2015-06-16 │ │ ... ... │ │ * x (x) float64 0.5 1.5 2.5 3.5 ... 356.5 357.5 358.5 359.5 │ │ x_bnds (x, bnds) float64 ... │ │ * y (y) float64 -88.0 -85.75 -85.25 ... 148.6 150.5 152.4 │ │ y_bnds (y, bnds) float64 ... │ │ zlev_bnds (lev, bnds) float64 ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: bnds, vertices │ │ Data variables: │ │ depth (y, x) float32 ... │ │ depth_c float64 ... │ │ eta (time, y, x) float32 ... │ │ nsigma int32 ... │ │ sigma (lev) float64 ... │ │ thetao (member_id, time, lev, y, x) float32 ... │ │ vertices_latitude (y, x, vertices) float32 ... │ │ vertices_longitude (y, x, vertices) float32 ... │ │ zlev (lev) float64 ... │ │ Attributes: (12/48) │ │ Conventions: CF-1.7 CMIP-6.2 │ │ activity_id: ScenarioMIP AerChemMIP │ │ branch_method: standard │ │ branch_time_in_child: 60265.0 │ │ branch_time_in_parent: 60265.0 │ │ cmor_version: 3.4.0 │ │ ... ... │ │ variable_id: thetao │ │ variant_label: r1i1p1f1 │ │ status: 2019-11-18;created;by nhn2@columbia.edu │ │ netcdf_tracking_ids: hdl:21.14100/99dda520-c9e9-4617-b4ca-0de0a2b9398... │ │ version_id: v20190627 │ │ intake_esm_dataset_key: ScenarioMIP/MIROC/MIROC6/ssp370/Omon/gn │ ├── DataTree('CCCma') │ │ └── DataTree('CanESM5') │ │ └── DataTree('ssp370') │ │ ├── DataTree('Amon') │ │ │ └── DataTree('gn') │ │ │ Dimensions: (lat: 64, bnds: 2, lon: 128, member_id: 1, time: 6) │ │ │ Coordinates: │ │ │ * lat (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86 │ │ │ lat_bnds (lat, bnds) float64 ... │ │ │ * lon (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2 │ │ │ lon_bnds (lon, bnds) float64 ... │ │ │ * time (time) object 2015-01-16 12:00:00 ... 2015-06-16 00:00:00 │ │ │ time_bnds (time, bnds) object ... │ │ │ * member_id (member_id) object 'r1i1p1f1' │ │ │ Dimensions without coordinates: bnds │ │ │ Data variables: │ │ │ pr (member_id, time, lat, lon) float32 ... │ │ │ Attributes: (12/57) │ │ │ CCCma_model_hash: 1f91f92cb6d607391f44831504025d32fc44faa1 │ │ │ CCCma_parent_runid: rc3.1-his01 │ │ │ CCCma_pycmor_hash: 33c30511acc319a98240633965a04ca99c26427e │ │ │ CCCma_runid: rc3.1-s7001 │ │ │ Conventions: CF-1.7 CMIP-6.2 │ │ │ YMDH_branch_time_in_child: 2015:01:01:00 │ │ │ ... ... │ │ │ tracking_id: hdl:21.14100/8c4a1496-f308-493e-8ecc-a2e253e... │ │ │ variable_id: pr │ │ │ variant_label: r1i1p1f1 │ │ │ version: v20190429 │ │ │ version_id: v20190429 │ │ │ intake_esm_dataset_key: ScenarioMIP/CCCma/CanESM5/ssp370/Amon/gn │ │ ├── DataTree('Lmon') │ │ │ └── DataTree('gn') │ │ │ Dimensions: (member_id: 1, time: 6, lat: 64, lon: 128, bnds: 2) │ │ │ Coordinates: │ │ │ * lat (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86 │ │ │ lat_bnds (lat, bnds) float64 ... │ │ │ * lon (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2 │ │ │ lon_bnds (lon, bnds) float64 ... │ │ │ * time (time) object 2015-01-16 12:00:00 ... 2015-06-16 00:00:00 │ │ │ time_bnds (time, bnds) object ... │ │ │ * member_id (member_id) object 'r1i1p1f1' │ │ │ Dimensions without coordinates: bnds │ │ │ Data variables: │ │ │ gpp (member_id, time, lat, lon) float32 ... │ │ │ mrso (member_id, time, lat, lon) float32 ... │ │ │ Attributes: (12/53) │ │ │ source: CanESM5 (2019): \naerosol: interactive\natmo... │ │ │ source_type: AOGCM │ │ │ institution: Canadian Centre for Climate Modelling and An... │ │ │ YMDH_branch_time_in_child: 2015:01:01:00 │ │ │ initialization_index: 1 │ │ │ product: model-output │ │ │ ... ... │ │ │ parent_time_units: days since 1850-01-01 0:0:0.0 │ │ │ parent_variant_label: r1i1p1f1 │ │ │ realization_index: 1 │ │ │ parent_activity_id: CMIP │ │ │ branch_time_in_parent: 60225.0 │ │ │ intake_esm_dataset_key: ScenarioMIP/CCCma/CanESM5/ssp370/Lmon/gn │ │ └── DataTree('Omon') │ │ └── DataTree('gn') │ │ Dimensions: (i: 360, j: 291, lev: 1, bnds: 2, member_id: 1, │ │ time: 6, vertices: 4) │ │ Coordinates: │ │ * i (i) int32 0 1 2 3 4 5 6 ... 353 354 355 356 357 358 359 │ │ * j (j) int32 0 1 2 3 4 5 6 ... 284 285 286 287 288 289 290 │ │ latitude (j, i) float64 ... │ │ * lev (lev) float64 3.047 │ │ lev_bnds (lev, bnds) float64 ... │ │ longitude (j, i) float64 ... │ │ * time (time) object 2015-01-16 12:00:00 ... 2015-06-16 00:0... │ │ time_bnds (time, bnds) object ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: bnds, vertices │ │ Data variables: │ │ no3 (member_id, time, lev, j, i) float32 ... │ │ vertices_latitude (j, i, vertices) float64 ... │ │ vertices_longitude (j, i, vertices) float64 ... │ │ thetao (member_id, time, lev, j, i) float32 ... │ │ Attributes: (12/52) │ │ source: CanESM5 (2019): \naerosol: interactive\natmo... │ │ source_type: AOGCM │ │ institution: Canadian Centre for Climate Modelling and An... │ │ YMDH_branch_time_in_child: 2015:01:01:00 │ │ initialization_index: 1 │ │ product: model-output │ │ ... ... │ │ parent_time_units: days since 1850-01-01 0:0:0.0 │ │ parent_variant_label: r1i1p1f1 │ │ realization_index: 1 │ │ parent_activity_id: CMIP │ │ branch_time_in_parent: 60225.0 │ │ intake_esm_dataset_key: ScenarioMIP/CCCma/CanESM5/ssp370/Omon/gn │ └── DataTree('NCAR') │ └── DataTree('CESM2-WACCM') │ └── DataTree('ssp370') │ ├── DataTree('Lmon') │ │ └── DataTree('gn') │ │ Dimensions: (lat: 192, lon: 288, time: 6, member_id: 1, hist_interval: 2) │ │ Coordinates: │ │ * lat (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0 │ │ * lon (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8 │ │ * time (time) object 2015-01-15 11:45:00 ... 2015-05-15 12:00:00 │ │ * member_id (member_id) object 'r1i1p1f1' │ │ lat_bnds (lat, hist_interval) float32 ... │ │ lon_bnds (lon, hist_interval) float32 ... │ │ time_bnds (time, hist_interval) object ... │ │ Dimensions without coordinates: hist_interval │ │ Data variables: │ │ gpp (member_id, time, lat, lon) float32 ... │ │ mrso (member_id, time, lat, lon) float32 ... │ │ Attributes: (12/45) │ │ source: CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ... │ │ source_type: AOGCM BGC CHEM AER │ │ institution: National Center for Atmospheric Research, Climat... │ │ initialization_index: 1 │ │ product: model-output │ │ branch_method: standard │ │ ... ... │ │ parent_time_units: days since 0001-01-01 00:00:00 │ │ parent_variant_label: r1i1p1f1 │ │ realization_index: 1 │ │ parent_activity_id: CMIP │ │ branch_time_in_parent: 735110.0 │ │ intake_esm_dataset_key: ScenarioMIP/NCAR/CESM2-WACCM/ssp370/Lmon/gn │ ├── DataTree('Amon') │ │ └── DataTree('gn') │ │ Dimensions: (member_id: 1, time: 6, plev: 1, lat: 192, lon: 288, nbnd: 2) │ │ Coordinates: │ │ * lat (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0 │ │ lat_bnds (lat, nbnd) float64 ... │ │ * lon (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8 │ │ lon_bnds (lon, nbnd) float64 ... │ │ * plev (plev) float64 1e+05 │ │ * time (time) object 2015-01-15 12:00:00 ... 2015-06-15 00:00:00 │ │ time_bnds (time, nbnd) object ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: nbnd │ │ Data variables: │ │ co2 (member_id, time, plev, lat, lon) float32 ... │ │ pr (member_id, time, lat, lon) float32 ... │ │ Attributes: (12/45) │ │ source: CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ... │ │ source_type: AOGCM BGC CHEM AER │ │ institution: National Center for Atmospheric Research, Climat... │ │ initialization_index: 1 │ │ product: model-output │ │ branch_method: standard │ │ ... ... │ │ parent_time_units: days since 0001-01-01 00:00:00 │ │ parent_variant_label: r1i1p1f1 │ │ realization_index: 1 │ │ parent_activity_id: CMIP │ │ branch_time_in_parent: 735110.0 │ │ intake_esm_dataset_key: ScenarioMIP/NCAR/CESM2-WACCM/ssp370/Amon/gn │ └── DataTree('Omon') │ └── DataTree('gn') │ Dimensions: (nlat: 384, nlon: 320, vertices: 4, lev: 1, d2: 2, member_id: 1, │ time: 6) │ Coordinates: │ lat (nlat, nlon) float64 ... │ lat_bnds (nlat, nlon, vertices) float32 ... │ * lev (lev) float64 500.0 │ lev_bnds (lev, d2) float32 ... │ lon (nlat, nlon) float64 ... │ lon_bnds (nlat, nlon, vertices) float32 ... │ * nlat (nlat) int32 1 2 3 4 5 6 7 8 ... 377 378 379 380 381 382 383 384 │ * nlon (nlon) int32 1 2 3 4 5 6 7 8 ... 313 314 315 316 317 318 319 320 │ * time (time) object 2015-01-15 13:00:00.000007 ... 2015-06-15 00:00:00 │ time_bnds (time, d2) object ... │ * member_id (member_id) object 'r1i1p1f1' │ Dimensions without coordinates: vertices, d2 │ Data variables: │ no3 (member_id, time, lev, nlat, nlon) float32 ... │ thetao (member_id, time, lev, nlat, nlon) float32 ... │ Attributes: (12/44) │ source: CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ... │ source_type: AOGCM BGC CHEM AER │ institution: National Center for Atmospheric Research, Climat... │ initialization_index: 1 │ product: model-output │ branch_method: standard │ ... ... │ parent_time_units: days since 0001-01-01 00:00:00 │ parent_variant_label: r1i1p1f1 │ realization_index: 1 │ parent_activity_id: CMIP │ branch_time_in_parent: 735110.0 │ intake_esm_dataset_key: ScenarioMIP/NCAR/CESM2-WACCM/ssp370/Omon/gn └── DataTree('CMIP') ├── DataTree('MIROC') │ └── DataTree('MIROC6') │ ├── DataTree('piControl') │ │ ├── DataTree('Amon') │ │ │ └── DataTree('gn') │ │ │ Dimensions: (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6) │ │ │ Coordinates: │ │ │ * lat (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93 │ │ │ lat_bnds (lat, bnds) float64 ... │ │ │ * lon (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6 │ │ │ lon_bnds (lon, bnds) float64 ... │ │ │ * time (time) object 3200-01-16 12:00:00 ... 3200-06-16 00:00:00 │ │ │ time_bnds (time, bnds) object ... │ │ │ * member_id (member_id) object 'r1i1p1f1' │ │ │ Dimensions without coordinates: bnds │ │ │ Data variables: │ │ │ pr (member_id, time, lat, lon) float32 ... │ │ │ Attributes: (12/48) │ │ │ Conventions: CF-1.7 CMIP-6.2 │ │ │ activity_id: CMIP │ │ │ branch_method: standard │ │ │ branch_time_in_child: 0.0 │ │ │ branch_time_in_parent: 365242.0 │ │ │ cmor_version: 3.3.2 │ │ │ ... ... │ │ │ variable_id: pr │ │ │ variant_label: r1i1p1f1 │ │ │ status: 2019-10-25;created;by nhn2@columbia.edu │ │ │ netcdf_tracking_ids: hdl:21.14100/9a510d79-f7ac-4cb4-9b47-6b6ec73ceb4... │ │ │ version_id: v20181212 │ │ │ intake_esm_dataset_key: CMIP/MIROC/MIROC6/piControl/Amon/gn │ │ ├── DataTree('Lmon') │ │ │ └── DataTree('gn') │ │ │ Dimensions: (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6) │ │ │ Coordinates: │ │ │ * lat (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93 │ │ │ lat_bnds (lat, bnds) float64 ... │ │ │ * lon (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6 │ │ │ lon_bnds (lon, bnds) float64 ... │ │ │ * time (time) object 3200-01-16 12:00:00 ... 3200-06-16 00:00:00 │ │ │ time_bnds (time, bnds) object ... │ │ │ * member_id (member_id) object 'r1i1p1f1' │ │ │ Dimensions without coordinates: bnds │ │ │ Data variables: │ │ │ mrso (member_id, time, lat, lon) float32 ... │ │ │ Attributes: (12/48) │ │ │ Conventions: CF-1.7 CMIP-6.2 │ │ │ activity_id: CMIP │ │ │ branch_method: standard │ │ │ branch_time_in_child: 0.0 │ │ │ branch_time_in_parent: 365242.0 │ │ │ cmor_version: 3.3.2 │ │ │ ... ... │ │ │ variable_id: mrso │ │ │ variant_label: r1i1p1f1 │ │ │ status: 2019-10-26;created;by nhn2@columbia.edu │ │ │ netcdf_tracking_ids: hdl:21.14100/ce661632-766a-45e6-9f98-8caa7391d24... │ │ │ version_id: v20190311 │ │ │ intake_esm_dataset_key: CMIP/MIROC/MIROC6/piControl/Lmon/gn │ │ └── DataTree('Omon') │ │ └── DataTree('gn') │ │ Dimensions: (y: 256, x: 360, time: 6, lev: 1, bnds: 2, │ │ member_id: 1, vertices: 4) │ │ Coordinates: (12/13) │ │ latitude (y, x) float32 ... │ │ * lev (lev) float64 1.0 │ │ lev_bnds (lev, bnds) float64 ... │ │ longitude (y, x) float32 ... │ │ sigma_bnds (lev, bnds) float64 ... │ │ * time (time) object 3200-01-16 12:00:00 ... 3200-06-16 00:0... │ │ ... ... │ │ * x (x) float64 0.5 1.5 2.5 3.5 ... 356.5 357.5 358.5 359.5 │ │ x_bnds (x, bnds) float64 ... │ │ * y (y) float64 -88.0 -85.75 -85.25 ... 148.6 150.5 152.4 │ │ y_bnds (y, bnds) float64 ... │ │ zlev_bnds (lev, bnds) float64 ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: bnds, vertices │ │ Data variables: │ │ depth (y, x) float32 ... │ │ depth_c float64 ... │ │ eta (time, y, x) float32 ... │ │ nsigma int32 ... │ │ sigma (lev) float64 ... │ │ thetao (member_id, time, lev, y, x) float32 ... │ │ vertices_latitude (y, x, vertices) float32 ... │ │ vertices_longitude (y, x, vertices) float32 ... │ │ zlev (lev) float64 ... │ │ Attributes: (12/48) │ │ Conventions: CF-1.7 CMIP-6.2 │ │ activity_id: CMIP │ │ branch_method: standard │ │ branch_time_in_child: 0.0 │ │ branch_time_in_parent: 365242.0 │ │ cmor_version: 3.3.2 │ │ ... ... │ │ variable_id: thetao │ │ variant_label: r1i1p1f1 │ │ status: 2019-10-25;created;by nhn2@columbia.edu │ │ netcdf_tracking_ids: hdl:21.14100/3d093cd5-61c4-4cb6-824e-d776d71e535... │ │ version_id: v20190311 │ │ intake_esm_dataset_key: CMIP/MIROC/MIROC6/piControl/Omon/gn │ └── DataTree('historical') │ ├── DataTree('Amon') │ │ └── DataTree('gn') │ │ Dimensions: (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6) │ │ Coordinates: │ │ * lat (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93 │ │ lat_bnds (lat, bnds) float64 ... │ │ * lon (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6 │ │ lon_bnds (lon, bnds) float64 ... │ │ * time (time) datetime64[ns] 1850-01-16T12:00:00 ... 1850-06-16 │ │ time_bnds (time, bnds) datetime64[ns] ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: bnds │ │ Data variables: │ │ pr (member_id, time, lat, lon) float32 ... │ │ Attributes: (12/48) │ │ Conventions: CF-1.7 CMIP-6.2 │ │ activity_id: CMIP │ │ branch_method: standard │ │ branch_time_in_child: 0.0 │ │ branch_time_in_parent: 0.0 │ │ cmor_version: 3.3.2 │ │ ... ... │ │ variable_id: pr │ │ variant_label: r1i1p1f1 │ │ status: 2019-10-25;created;by nhn2@columbia.edu │ │ netcdf_tracking_ids: hdl:21.14100/61fa8b6b-e74c-4e86-9344-8ba946ee8a8... │ │ version_id: v20181212 │ │ intake_esm_dataset_key: CMIP/MIROC/MIROC6/historical/Amon/gn │ ├── DataTree('Lmon') │ │ └── DataTree('gn') │ │ Dimensions: (lat: 128, bnds: 2, lon: 256, member_id: 1, time: 6) │ │ Coordinates: │ │ * lat (lat) float64 -88.93 -87.54 -86.14 -84.74 ... 86.14 87.54 88.93 │ │ lat_bnds (lat, bnds) float64 ... │ │ * lon (lon) float64 0.0 1.406 2.812 4.219 ... 354.4 355.8 357.2 358.6 │ │ lon_bnds (lon, bnds) float64 ... │ │ * time (time) datetime64[ns] 1850-01-16T12:00:00 ... 1850-06-16 │ │ time_bnds (time, bnds) datetime64[ns] ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: bnds │ │ Data variables: │ │ mrso (member_id, time, lat, lon) float32 ... │ │ Attributes: (12/48) │ │ Conventions: CF-1.7 CMIP-6.2 │ │ activity_id: CMIP │ │ branch_method: standard │ │ branch_time_in_child: 0.0 │ │ branch_time_in_parent: 0.0 │ │ cmor_version: 3.3.2 │ │ ... ... │ │ variable_id: mrso │ │ variant_label: r1i1p1f1 │ │ status: 2019-10-25;created;by nhn2@columbia.edu │ │ netcdf_tracking_ids: hdl:21.14100/a702781b-b6d9-4f90-a65d-c649d59a224... │ │ version_id: v20190311 │ │ intake_esm_dataset_key: CMIP/MIROC/MIROC6/historical/Lmon/gn │ └── DataTree('Omon') │ └── DataTree('gn') │ Dimensions: (y: 256, x: 360, time: 6, lev: 1, bnds: 2, │ member_id: 1, vertices: 4) │ Coordinates: (12/13) │ latitude (y, x) float32 ... │ * lev (lev) float64 1.0 │ lev_bnds (lev, bnds) float64 ... │ longitude (y, x) float32 ... │ sigma_bnds (lev, bnds) float64 ... │ * time (time) datetime64[ns] 1850-01-16T12:00:00 ... 1850-06-16 │ ... ... │ * x (x) float64 0.5 1.5 2.5 3.5 ... 356.5 357.5 358.5 359.5 │ x_bnds (x, bnds) float64 ... │ * y (y) float64 -88.0 -85.75 -85.25 ... 148.6 150.5 152.4 │ y_bnds (y, bnds) float64 ... │ zlev_bnds (lev, bnds) float64 ... │ * member_id (member_id) object 'r1i1p1f1' │ Dimensions without coordinates: bnds, vertices │ Data variables: │ depth (y, x) float32 ... │ depth_c float64 ... │ eta (time, y, x) float32 ... │ nsigma int32 ... │ sigma (lev) float64 ... │ thetao (member_id, time, lev, y, x) float32 ... │ vertices_latitude (y, x, vertices) float32 ... │ vertices_longitude (y, x, vertices) float32 ... │ zlev (lev) float64 ... │ Attributes: (12/48) │ Conventions: CF-1.7 CMIP-6.2 │ activity_id: CMIP │ branch_method: standard │ branch_time_in_child: 0.0 │ branch_time_in_parent: 0.0 │ cmor_version: 3.3.2 │ ... ... │ variable_id: thetao │ variant_label: r1i1p1f1 │ status: 2019-11-08;created;by nhn2@columbia.edu │ netcdf_tracking_ids: hdl:21.14100/16598b35-19b4-49e3-98de-27b9e9444ad... │ version_id: v20190311 │ intake_esm_dataset_key: CMIP/MIROC/MIROC6/historical/Omon/gn ├── DataTree('CCCma') │ └── DataTree('CanESM5') │ ├── DataTree('piControl') │ │ ├── DataTree('Amon') │ │ │ └── DataTree('gn') │ │ │ Dimensions: (lat: 64, bnds: 2, lon: 128, member_id: 1, time: 6) │ │ │ Coordinates: │ │ │ * lat (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86 │ │ │ lat_bnds (lat, bnds) float64 ... │ │ │ * lon (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2 │ │ │ lon_bnds (lon, bnds) float64 ... │ │ │ * time (time) object 5201-01-16 12:00:00 ... 5201-06-16 00:00:00 │ │ │ time_bnds (time, bnds) object ... │ │ │ * member_id (member_id) object 'r1i1p1f1' │ │ │ Dimensions without coordinates: bnds │ │ │ Data variables: │ │ │ pr (member_id, time, lat, lon) float32 ... │ │ │ Attributes: (12/57) │ │ │ CCCma_model_hash: 24718c8346665b218729640ffe79d263b76456c4 │ │ │ CCCma_parent_runid: rc3-pictrl │ │ │ CCCma_pycmor_hash: 33c30511acc319a98240633965a04ca99c26427e │ │ │ CCCma_runid: rc3.1-pictrl │ │ │ Conventions: CF-1.7 CMIP-6.2 │ │ │ YMDH_branch_time_in_child: 5201:01:01:00 │ │ │ ... ... │ │ │ variant_label: r1i1p1f1 │ │ │ version: v20190429 │ │ │ status: 2019-10-25;created;by nhn2@columbia.edu │ │ │ netcdf_tracking_ids: hdl:21.14100/e32e34a8-13cf-45b5-ba68-3d6b584... │ │ │ version_id: v20190429 │ │ │ intake_esm_dataset_key: CMIP/CCCma/CanESM5/piControl/Amon/gn │ │ ├── DataTree('Lmon') │ │ │ └── DataTree('gn') │ │ │ Dimensions: (member_id: 1, time: 6, lat: 64, lon: 128, bnds: 2) │ │ │ Coordinates: │ │ │ * lat (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86 │ │ │ lat_bnds (lat, bnds) float64 ... │ │ │ * lon (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2 │ │ │ lon_bnds (lon, bnds) float64 ... │ │ │ * time (time) object 5201-01-16 12:00:00 ... 5201-06-16 00:00:00 │ │ │ time_bnds (time, bnds) object ... │ │ │ * member_id (member_id) object 'r1i1p1f1' │ │ │ Dimensions without coordinates: bnds │ │ │ Data variables: │ │ │ gpp (member_id, time, lat, lon) float32 ... │ │ │ mrso (member_id, time, lat, lon) float32 ... │ │ │ Attributes: (12/47) │ │ │ source_type: AOGCM │ │ │ YMDH_branch_time_in_child: 5201:01:01:00 │ │ │ initialization_index: 1 │ │ │ product: model-output │ │ │ branch_method: Spin-up documentation │ │ │ mip_era: CMIP6 │ │ │ ... ... │ │ │ parent_time_units: days since 1850-01-01 0:0:0.0 │ │ │ parent_variant_label: r1i1p1f1 │ │ │ realization_index: 1 │ │ │ parent_activity_id: CMIP │ │ │ branch_time_in_parent: 1223115.0 │ │ │ intake_esm_dataset_key: CMIP/CCCma/CanESM5/piControl/Lmon/gn │ │ └── DataTree('Omon') │ │ └── DataTree('gn') │ │ Dimensions: (i: 360, j: 291, lev: 1, bnds: 2, member_id: 1, │ │ time: 6, vertices: 4) │ │ Coordinates: │ │ * i (i) int32 0 1 2 3 4 5 6 ... 353 354 355 356 357 358 359 │ │ * j (j) int32 0 1 2 3 4 5 6 ... 284 285 286 287 288 289 290 │ │ latitude (j, i) float64 ... │ │ * lev (lev) float64 3.047 │ │ lev_bnds (lev, bnds) float64 ... │ │ longitude (j, i) float64 ... │ │ * time (time) object 5201-01-16 12:00:00 ... 5201-06-16 00:0... │ │ time_bnds (time, bnds) object ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: bnds, vertices │ │ Data variables: │ │ no3 (member_id, time, lev, j, i) float32 ... │ │ vertices_latitude (j, i, vertices) float64 ... │ │ vertices_longitude (j, i, vertices) float64 ... │ │ thetao (member_id, time, lev, j, i) float32 ... │ │ Attributes: (12/52) │ │ source: CanESM5 (2019): \naerosol: interactive\natmo... │ │ source_type: AOGCM │ │ institution: Canadian Centre for Climate Modelling and An... │ │ YMDH_branch_time_in_child: 5201:01:01:00 │ │ initialization_index: 1 │ │ product: model-output │ │ ... ... │ │ parent_time_units: days since 1850-01-01 0:0:0.0 │ │ parent_variant_label: r1i1p1f1 │ │ realization_index: 1 │ │ parent_activity_id: CMIP │ │ branch_time_in_parent: 1223115.0 │ │ intake_esm_dataset_key: CMIP/CCCma/CanESM5/piControl/Omon/gn │ └── DataTree('historical') │ ├── DataTree('Amon') │ │ └── DataTree('gn') │ │ Dimensions: (lat: 64, bnds: 2, lon: 128, member_id: 1, time: 6) │ │ Coordinates: │ │ * lat (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86 │ │ lat_bnds (lat, bnds) float64 ... │ │ * lon (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2 │ │ lon_bnds (lon, bnds) float64 ... │ │ * time (time) object 1850-01-16 12:00:00 ... 1850-06-16 00:00:00 │ │ time_bnds (time, bnds) object ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: bnds │ │ Data variables: │ │ pr (member_id, time, lat, lon) float32 ... │ │ Attributes: (12/57) │ │ CCCma_model_hash: 3dedf95315d603326fde4f5340dc0519d80d10c0 │ │ CCCma_parent_runid: rc3-pictrl │ │ CCCma_pycmor_hash: 33c30511acc319a98240633965a04ca99c26427e │ │ CCCma_runid: rc3.1-his01 │ │ Conventions: CF-1.7 CMIP-6.2 │ │ YMDH_branch_time_in_child: 1850:01:01:00 │ │ ... ... │ │ variant_label: r1i1p1f1 │ │ version: v20190429 │ │ status: 2019-10-25;created;by nhn2@columbia.edu │ │ netcdf_tracking_ids: hdl:21.14100/363e1ebe-46e7-43dc-9feb-a7a4a0c... │ │ version_id: v20190429 │ │ intake_esm_dataset_key: CMIP/CCCma/CanESM5/historical/Amon/gn │ ├── DataTree('Lmon') │ │ └── DataTree('gn') │ │ Dimensions: (member_id: 1, time: 6, lat: 64, lon: 128, bnds: 2) │ │ Coordinates: │ │ * lat (lat) float64 -87.86 -85.1 -82.31 -79.53 ... 82.31 85.1 87.86 │ │ lat_bnds (lat, bnds) float64 ... │ │ * lon (lon) float64 0.0 2.812 5.625 8.438 ... 348.8 351.6 354.4 357.2 │ │ lon_bnds (lon, bnds) float64 ... │ │ * time (time) object 1850-01-16 12:00:00 ... 1850-06-16 00:00:00 │ │ time_bnds (time, bnds) object ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: bnds │ │ Data variables: │ │ gpp (member_id, time, lat, lon) float32 ... │ │ mrso (member_id, time, lat, lon) float32 ... │ │ Attributes: (12/53) │ │ source: CanESM5 (2019): \naerosol: interactive\natmo... │ │ source_type: AOGCM │ │ institution: Canadian Centre for Climate Modelling and An... │ │ YMDH_branch_time_in_child: 1850:01:01:00 │ │ initialization_index: 1 │ │ product: model-output │ │ ... ... │ │ parent_time_units: days since 1850-01-01 0:0:0.0 │ │ parent_variant_label: r1i1p1f1 │ │ realization_index: 1 │ │ parent_activity_id: CMIP │ │ branch_time_in_parent: 1223115.0 │ │ intake_esm_dataset_key: CMIP/CCCma/CanESM5/historical/Lmon/gn │ └── DataTree('Omon') │ └── DataTree('gn') │ Dimensions: (i: 360, j: 291, lev: 1, bnds: 2, member_id: 1, │ time: 6, vertices: 4) │ Coordinates: │ * i (i) int32 0 1 2 3 4 5 6 ... 353 354 355 356 357 358 359 │ * j (j) int32 0 1 2 3 4 5 6 ... 284 285 286 287 288 289 290 │ latitude (j, i) float64 ... │ * lev (lev) float64 3.047 │ lev_bnds (lev, bnds) float64 ... │ longitude (j, i) float64 ... │ * time (time) object 1850-01-16 12:00:00 ... 1850-06-16 00:0... │ time_bnds (time, bnds) object ... │ * member_id (member_id) object 'r1i1p1f1' │ Dimensions without coordinates: bnds, vertices │ Data variables: │ no3 (member_id, time, lev, j, i) float32 ... │ vertices_latitude (j, i, vertices) float64 ... │ vertices_longitude (j, i, vertices) float64 ... │ thetao (member_id, time, lev, j, i) float32 ... │ Attributes: (12/52) │ source: CanESM5 (2019): \naerosol: interactive\natmo... │ source_type: AOGCM │ institution: Canadian Centre for Climate Modelling and An... │ YMDH_branch_time_in_child: 1850:01:01:00 │ initialization_index: 1 │ product: model-output │ ... ... │ parent_time_units: days since 1850-01-01 0:0:0.0 │ parent_variant_label: r1i1p1f1 │ realization_index: 1 │ parent_activity_id: CMIP │ branch_time_in_parent: 1223115.0 │ intake_esm_dataset_key: CMIP/CCCma/CanESM5/historical/Omon/gn └── DataTree('NCAR') └── DataTree('CESM2-WACCM') ├── DataTree('historical') │ ├── DataTree('Amon') │ │ └── DataTree('gn') │ │ Dimensions: (member_id: 1, time: 6, plev: 1, lat: 192, lon: 288, nbnd: 2) │ │ Coordinates: │ │ * lat (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0 │ │ lat_bnds (lat, nbnd) float64 ... │ │ * lon (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8 │ │ lon_bnds (lon, nbnd) float64 ... │ │ * plev (plev) float64 1e+05 │ │ * time (time) object 1850-01-15 12:00:00 ... 1850-06-15 00:00:00 │ │ time_bnds (time, nbnd) object ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: nbnd │ │ Data variables: │ │ co2 (member_id, time, plev, lat, lon) float32 ... │ │ pr (member_id, time, lat, lon) float32 ... │ │ Attributes: (12/46) │ │ source: CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ... │ │ source_type: AOGCM BGC CHEM AER │ │ institution: National Center for Atmospheric Research, Climat... │ │ initialization_index: 1 │ │ product: model-output │ │ branch_method: standard │ │ ... ... │ │ parent_time_units: days since 0001-01-01 00:00:00 │ │ parent_variant_label: r1i1p1f1 │ │ realization_index: 1 │ │ parent_activity_id: CMIP │ │ branch_time_in_parent: 20075.0 │ │ intake_esm_dataset_key: CMIP/NCAR/CESM2-WACCM/historical/Amon/gn │ ├── DataTree('Lmon') │ │ └── DataTree('gn') │ │ Dimensions: (member_id: 1, time: 6, lat: 192, lon: 288, hist_interval: 2) │ │ Coordinates: │ │ * lat (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0 │ │ lat_bnds (lat, hist_interval) float32 ... │ │ * lon (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8 │ │ lon_bnds (lon, hist_interval) float32 ... │ │ * time (time) object 1850-01-15 11:45:00.000013 ... 1850-06-15 00:00:00 │ │ time_bnds (time, hist_interval) object ... │ │ * member_id (member_id) object 'r1i1p1f1' │ │ Dimensions without coordinates: hist_interval │ │ Data variables: │ │ gpp (member_id, time, lat, lon) float32 ... │ │ mrso (member_id, time, lat, lon) float32 ... │ │ Attributes: (12/46) │ │ source: CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ... │ │ source_type: AOGCM BGC CHEM AER │ │ institution: National Center for Atmospheric Research, Climat... │ │ initialization_index: 1 │ │ product: model-output │ │ branch_method: standard │ │ ... ... │ │ parent_time_units: days since 0001-01-01 00:00:00 │ │ parent_variant_label: r1i1p1f1 │ │ realization_index: 1 │ │ parent_activity_id: CMIP │ │ branch_time_in_parent: 20075.0 │ │ intake_esm_dataset_key: CMIP/NCAR/CESM2-WACCM/historical/Lmon/gn │ └── DataTree('Omon') │ └── DataTree('gn') │ Dimensions: (nlat: 384, nlon: 320, vertices: 4, lev: 1, d2: 2, member_id: 1, │ time: 6) │ Coordinates: │ lat (nlat, nlon) float64 ... │ lat_bnds (nlat, nlon, vertices) float32 ... │ * lev (lev) float64 500.0 │ lev_bnds (lev, d2) float32 ... │ lon (nlat, nlon) float64 ... │ lon_bnds (nlat, nlon, vertices) float32 ... │ * nlat (nlat) int32 1 2 3 4 5 6 7 8 ... 377 378 379 380 381 382 383 384 │ * nlon (nlon) int32 1 2 3 4 5 6 7 8 ... 313 314 315 316 317 318 319 320 │ * time (time) object 1850-01-15 13:00:00 ... 1850-06-15 00:00:00 │ time_bnds (time, d2) object ... │ * member_id (member_id) object 'r1i1p1f1' │ Dimensions without coordinates: vertices, d2 │ Data variables: │ no3 (member_id, time, lev, nlat, nlon) float32 ... │ thetao (member_id, time, lev, nlat, nlon) float32 ... │ Attributes: (12/45) │ source: CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ... │ source_type: AOGCM BGC CHEM AER │ institution: National Center for Atmospheric Research, Climat... │ initialization_index: 1 │ product: model-output │ branch_method: standard │ ... ... │ parent_time_units: days since 0001-01-01 00:00:00 │ parent_variant_label: r1i1p1f1 │ realization_index: 1 │ parent_activity_id: CMIP │ branch_time_in_parent: 20075.0 │ intake_esm_dataset_key: CMIP/NCAR/CESM2-WACCM/historical/Omon/gn └── DataTree('piControl') ├── DataTree('Lmon') │ └── DataTree('gn') │ Dimensions: (lat: 192, lon: 288, time: 6, member_id: 1, hist_interval: 2) │ Coordinates: │ * lat (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0 │ * lon (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8 │ * time (time) object 0001-01-15 11:44:59.999999 ... 0001-03-15 12:00:... │ * member_id (member_id) object 'r1i1p1f1' │ lat_bnds (lat, hist_interval) float32 ... │ lon_bnds (lon, hist_interval) float32 ... │ time_bnds (time, hist_interval) object ... │ Dimensions without coordinates: hist_interval │ Data variables: │ gpp (member_id, time, lat, lon) float32 ... │ mrso (member_id, time, lat, lon) float32 ... │ Attributes: (12/45) │ source: CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ... │ source_type: AOGCM BGC CHEM AER │ institution: National Center for Atmospheric Research, Climat... │ initialization_index: 1 │ product: model-output │ branch_method: standard │ ... ... │ parent_time_units: days since 0001-01-01 00:00:00 │ parent_variant_label: r1i1p1f1 │ realization_index: 1 │ parent_activity_id: CMIP │ branch_time_in_parent: 48545.0 │ intake_esm_dataset_key: CMIP/NCAR/CESM2-WACCM/piControl/Lmon/gn ├── DataTree('Amon') │ └── DataTree('gn') │ Dimensions: (member_id: 1, time: 6, plev: 1, lat: 192, lon: 288, nbnd: 2) │ Coordinates: │ * lat (lat) float64 -90.0 -89.06 -88.12 -87.17 ... 88.12 89.06 90.0 │ lat_bnds (lat, nbnd) float64 ... │ * lon (lon) float64 0.0 1.25 2.5 3.75 5.0 ... 355.0 356.2 357.5 358.8 │ lon_bnds (lon, nbnd) float64 ... │ * plev (plev) float64 1e+05 │ * time (time) object 0001-01-15 12:00:00 ... 0001-06-15 00:00:00 │ time_bnds (time, nbnd) object ... │ * member_id (member_id) object 'r1i1p1f1' │ Dimensions without coordinates: nbnd │ Data variables: │ co2 (member_id, time, plev, lat, lon) float32 ... │ pr (member_id, time, lat, lon) float32 ... │ Attributes: (12/47) │ source: CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ... │ source_type: AOGCM BGC CHEM AER │ institution: National Center for Atmospheric Research, Climat... │ initialization_index: 1 │ product: model-output │ branch_method: standard │ ... ... │ parent_time_units: days since 0001-01-01 00:00:00 │ parent_variant_label: r1i1p1f1 │ realization_index: 1 │ parent_activity_id: CMIP │ branch_time_in_parent: 48545.0 │ intake_esm_dataset_key: CMIP/NCAR/CESM2-WACCM/piControl/Amon/gn └── DataTree('Omon') └── DataTree('gn') Dimensions: (nlat: 384, nlon: 320, vertices: 4, lev: 1, d2: 2, member_id: 1, time: 6) Coordinates: lat (nlat, nlon) float64 ... lat_bnds (nlat, nlon, vertices) float32 ... * lev (lev) float64 500.0 lev_bnds (lev, d2) float32 ... lon (nlat, nlon) float64 ... lon_bnds (nlat, nlon, vertices) float32 ... * nlat (nlat) int32 1 2 3 4 5 6 7 8 ... 377 378 379 380 381 382 383 384 * nlon (nlon) int32 1 2 3 4 5 6 7 8 ... 313 314 315 316 317 318 319 320 * time (time) object 0001-01-15 13:00:00.999998 ... 0001-06-15 00:00:... time_bnds (time, d2) object ... * member_id (member_id) object 'r1i1p1f1' Dimensions without coordinates: vertices, d2 Data variables: no3 (member_id, time, lev, nlat, nlon) float32 ... thetao (member_id, time, lev, nlat, nlon) float32 ... Attributes: (12/44) source: CESM2 (2017): atmosphere: CAM6 (0.9x1.25 finite ... source_type: AOGCM BGC CHEM AER institution: National Center for Atmospheric Research, Climat... initialization_index: 1 product: model-output branch_method: standard ... ... parent_time_units: days since 0001-01-01 00:00:00 parent_variant_label: r1i1p1f1 realization_index: 1 parent_activity_id: CMIP branch_time_in_parent: 48545.0 intake_esm_dataset_key: CMIP/NCAR/CESM2-WACCM/piControl/Omon/gn ```
❯ ls -ltrh datasets
total 141M
-rw-rw-r-- 1 andersy005 andersy005  22M Aug  3 12:08 cesm2-lens-sample.nc
-rw-rw-r-- 1 andersy005 andersy005 119M Aug  3 12:59 cmip6-sample.nc

Do you prefer to have these hosted in a cloud storage bucket? We (at CarbonPlan) might be able to host these (Cc @jhamman)

andersy005 commented 1 year ago

@andersy005 @jhamman just a ping to say that if either of you can help me get some data like this I could really push forward with writing cool documentation for datatree!

Ccing @mgrover1 / @scottyhq who may have some ideas about additional radar / satellite datasets we could use

mgrover1 commented 1 year ago

Here is a demo notebook using datatree with radar data - plenty of other examples in that repo too.

https://github.com/mgrover1/pyart-xarray-sandbox/blob/main/notebooks/test-hierachal-multivolume.ipynb

Another example dataset might be a grib file with fields on different levels (isobaric pressure, surface, height above ground)

mgrover1 commented 1 year ago

I would be willing to help write some of the docs for the precious two use cases!