Defra-Data-Science-Centre-of-Excellence / sds-data-model

A common data model for the Spatial Data Science unit
https://defra-data-science-centre-of-excellence.github.io/sds-data-model/
MIT License
0 stars 0 forks source link

Parse Gemini 2.3 metadata #1

Open EFT-Defra opened 2 years ago

EFT-Defra commented 2 years ago

Spatial datasets published by the UK government should have associated metadata that conforms to the UK GEMIN standard. Ultimately, we want to parse that metadata into a python dictionary that can be attached to a xarray DataArray or Dataset via the attrs property .

Summary of metadata elements

Dataset of series encoding guidelines

I think we should break this down into the follow tasks:

s-flint-defra commented 1 year ago

@EFT-Defra I notice there hasnt been much activity on GEMINI metadata recently. Is this due t lack of time/resource and so should stay in the backlog?

EFT-Defra commented 1 year ago

@s-flint-defra, you're correct, there hasn't been any work on this for a while and that's been due to resource constraints. In fact, I don't think there's been any work on this since Jordan left. He summarised where he'd go to here: #54.

I still think this work is important, because valid Gemini metadata files provide a lot of information that might be useful to analysts (like provenance and access / licence restrictions), but there's quite a lot to do and it isn't essential to an MVP.

As well as parsing the individual metadata files, we'll also need to think about how we'd could combine metadata when combining datasets.