Open EFT-Defra opened 2 years ago
@EFT-Defra I notice there hasnt been much activity on GEMINI metadata recently. Is this due t lack of time/resource and so should stay in the backlog?
@s-flint-defra, you're correct, there hasn't been any work on this for a while and that's been due to resource constraints. In fact, I don't think there's been any work on this since Jordan left. He summarised where he'd go to here: #54.
I still think this work is important, because valid Gemini metadata files provide a lot of information that might be useful to analysts (like provenance and access / licence restrictions), but there's quite a lot to do and it isn't essential to an MVP.
As well as parsing the individual metadata files, we'll also need to think about how we'd could combine metadata when combining datasets.
Spatial datasets published by the UK government should have associated metadata that conforms to the UK GEMIN standard. Ultimately, we want to parse that metadata into a python dictionary that can be attached to a xarray DataArray or Dataset via the
attrs
property .Summary of metadata elements
Dataset of series encoding guidelines
I think we should break this down into the follow tasks: