leap-stc / data-management

Collection of code to manually populate the persistent cloud bucket with data
https://catalog.leap.columbia.edu/
Apache License 2.0
0 stars 6 forks source link

New Dataset [eNATL60-TSW-60m] #73

Open auraoupa opened 10 months ago

auraoupa commented 10 months ago

Dataset Name

eNATL60 TSW 60m

Dataset URL

No response

Description

eNATL60-TSW-60m is an extraction of a very high resolution oceanic simulation of the North Atlantic performed at MEOM, IGE (FRANCE) that will help design sub-grid parametrization based on machine learning in M2LINES framework

Size

The dataset consists in one year of ~600MB daily files for a total of 202Gb of

License

Creative Commons Zero v1.0 Universal

Data Format

NetCDF

Data Format (other)

No response

Access protocol

HTTP(S)

Source File Organization

365 daily files organized by month in zenodo records

Example URLs

https://zenodo.org/records/10261274/files/eNATL60-BLBT02_y2009m07d01.1d_TSW_60m.nc

Authorization

None

Transformation / Processing

Concatenation along the time dimension

Target Format

Zarr

Comments

I wrote the first part of a pangeo-forge recipe here so that the zenodo record name is attributed for each month

auraoupa commented 10 months ago

👋 @jbusecke

jbusecke commented 9 months ago

👋 @auraoupa . Thank you so much for raising this issue. I am working on this over at #75.