Open jacobbieker opened 1 year ago
The HuggingFace datasets script will be hosted here https://huggingface.co/datasets/openclimatefix/arco-era5
Hi, is this issue available to work on?
Yes, I believe so, I can assign you it?
Yes, that sounds good!
Detailed Description
The Google ERA5 archive is available: https://github.com/google-research/ARCO-ERA5 and goes back quite far. Its currently being expanded, but already covers years of data at the native model resolution. This is probably quicker and faster to pull data from than HuggingFace, and is already on Google Public Datasets, so should be easy to integrate.
Context
Its a lot of data in an accessible and publicly available place that we can try to leverage for training the graph models easily.
Possible Implementation
A HuggingFace dataset script like for the EUMETSAT data? Or can just add Xarray opening it in this repo directly as well.