halo-db / storymap

Story mapping
Creative Commons Zero v1.0 Universal
3 stars 0 forks source link

Access data as N-D labeled datasets #56

Open d70-t opened 3 years ago

d70-t commented 3 years ago

There's a concept of general N-D labeled datasets which is growing more and more within the community. As a user of the datasets from the HALO-DB, I want to access the data as an N-D labeled dataset. The obtained datasets should follow a widely accepted convention (e.g. CF-Conventions).

Note on N-D labeled datasets. The basic idea is to collect a couple of multidimensional arrays into one dataset. The dimensions of the arrays are labelled and can are shared between arrays within a dataset. Datasets and arrays additionally carry attributes which provide more information. Conventions like CF Conventions provide more information about how a dataset should be interpreted.

N-D labeled datasets show up in various forms:

Note that CF-Conventions may be applied equally well to all of those formats. There are small and sometimes subtle differences between the various forms, but most datasets can be converted without any loss between those variants. And there's value in crafting a dataset deliberately in a way which allows transformations between these formats as the serve quite distinct purposes.

Client libraries are particularly important, as a user usually doesn't care which of the storage variants are served. The fact that netCDF plans to support all variants will help in this regard.