openclimatefix / graph_weather

PyTorch implementation of Ryan Keisler's 2022 "Forecasting Global Weather with Graph Neural Networks" paper (https://arxiv.org/abs/2202.07575)
MIT License
186 stars 46 forks source link

Add support for training on Google's ARCO ERA5 #50

Open jacobbieker opened 1 year ago

jacobbieker commented 1 year ago

Detailed Description

The Google ERA5 archive is available: https://github.com/google-research/ARCO-ERA5 and goes back quite far. Its currently being expanded, but already covers years of data at the native model resolution. This is probably quicker and faster to pull data from than HuggingFace, and is already on Google Public Datasets, so should be easy to integrate.

Context

Its a lot of data in an accessible and publicly available place that we can try to leverage for training the graph models easily.

Possible Implementation

A HuggingFace dataset script like for the EUMETSAT data? Or can just add Xarray opening it in this repo directly as well.

jacobbieker commented 1 year ago

The HuggingFace datasets script will be hosted here https://huggingface.co/datasets/openclimatefix/arco-era5

ishachadalavada commented 5 months ago

Hi, is this issue available to work on?

jacobbieker commented 5 months ago

Yes, I believe so, I can assign you it?

ishachadalavada commented 5 months ago

Yes, that sounds good!