Open ladsmund opened 4 months ago
We should also consider:
pypromice
version, etc., to make the iteration ID deterministic.uuid3
is a hash function that generates a 128-bit number from an input string, designed to be globally unique. The output depends solely on the input string (and namespace) and will always return the same value for the same input. A benefit of using a hash function for the IDs is to control and limit the format of the ID string. This might be especially relevant for point (3).
https://github.com/GEUS-Glaciology-and-Climate/pypromice/pull/252#discussion_r1652118161
Just found this publication that describes a procedure to define and manage attributes in netcdf from observation programs:
Uttal, T., Hartten, L. M., Khalsa, S. J., Casati, B., Svensson, G., Day, J., Holt, J., Akish, E., Morris, S., O'Connor, E., Pirazzini, R., Huang, L. X., Crawford, R., Mariani, Z., Godøy, Ø., Tjernström, J. A. K., Prakash, G., Hickmon, N., Maturilli, M., and Cox, C. J.: Merged Observatory Data Files (MODFs): an integrated observational data product supporting process-oriented investigations and diagnostics, Geosci. Model Dev., 17, 5225–5247, https://doi.org/10.5194/gmd-17-5225-2024, 2024.
We need to update our current processing pipeline to align with the Attribute Convention for Data Discovery (ACDD) 1-3 guidelines. This will improve the consistency, discoverability, and interoperability of our datasets.
The convention has a subset of attributes which are Highly Recommended that we should prioritize to follow.
In addition, I also suggest we maintain a
source
attribute and maybeproduct_version
attribute for reproducability and to determine the need for reprocessing.https://wiki.esipfed.org/Attribute_Convention_for_Data_Discovery_1-3#Index_by_Attribute_Name
dk.geus.promice.station.daily.QAS_Lv3
dk.geus.promice.site.daily.QAS_L