google-research / arco-era5

Recipes for reproducing Analysis-Ready & Cloud Optimized (ARCO) ERA5 datasets.
https://cloud.google.com/storage/docs/public-datasets/era5
Apache License 2.0
287 stars 22 forks source link

Missing v_component_of_wind and zarr metadata #76

Closed markmbaum closed 2 months ago

markmbaum commented 2 months ago

I noticed that the dataset here: https://console.cloud.google.com/storage/browser/gcp-public-data-arco-era5/ar/1959-2023_01_10-full_37-1h-1440x721.zarr

is missing the v_component_of_wind variable. It's also missing the usual zarr metadata files (.zattrs, .zgroup, .zmetadata) and can't be opened in the usual way. For example,

import xarray as xr

xr.open_zarr("gs://gcp-public-data-arco-era5/ar/1959-2023_01_10-full_37-1h-1440x721.zarr")

raises FileNotFoundError because it cant find those metadata files.

shoyer commented 2 months ago

Is there a reason you're trying to access this particular data? It is not described in the README. I think this is a partially written test version of data that should probably be deleted.

The full version of this dataset is in: gs://gcp-public-data-arco-era5/ar/1959-2023_01_10-full_37-1h-0p25deg-chunk-1.zarr

markmbaum commented 2 months ago

No I don't need this particular dataset. I was just looking around, noticed it, and thought I would mention it.