Open alexandrefierro opened 8 months ago
Unfortunately, providing the dataset would be too large for a repo, since we need daily zonal and meridional wind fields at both 250- and 850-hPa (it's possible though, but would be on the order of a few GB for 5 years). Did you try the provided script to download the ERA5 data? You can restrict it to 5 years in the namelist.
Good afternoon Jonathan:
Yes; I am familiar with this (modified) ERA5 download script & restricted the retrieval to the last 5-6 years but the data transfer still takes quite a bit of time to reach completion on our Linux cluster (despite the very useful pooling tool you use). Many of the individual transfers are interrupted with: WARNING HTTP error: [500 Internal Server Error]. Attempt 1 of 500. before resuming; hence my query.
Thank you.
Perhaps the ERA5 server was inundated at the time of your request -- in my experience, running the download script has been very quick. I could look into providing a low-resolution version of the datasets, but I'm sorry I don't have a better answer right now!
The CDS is being upgraded, and they note that some disruptions are possible. I've found that some periods are very slow.
I've recently experienced the same issue as of today (Oct. 4, 2024). Some files do not successfully download then when you go to run, dask throws a segmentation fault as the files can't be found.
I'll consider adding some test fields in a Zenodo repository in the future!
Greetings Jonathan:
Does your repo contain all the necessary input ERA-5 data for the last ~5-10 years for users to be able to run a quick test? Rationale for asking is that requesting these files via the standard c = cdsapi.Client() is extremely slow and often fails with connection errors (likely because the data reside on tape systems). The ECMWF MARS server is no better. The ERA-5 files used are generally quite small so adding them as a standalone tar ball in this repo or on a separate SFTP server would be very useful.
Cheers and thanks,