SeismicData / pyasdf

Python Interface to ASDF based on ObsPy
http://seismicdata.github.io/pyasdf/
BSD 3-Clause "New" or "Revised" License
53 stars 30 forks source link

Seis-prov hash IDs #54

Closed mhearne-usgs closed 5 years ago

mhearne-usgs commented 5 years ago

We're starting to use ASDF and seis-prov for some strong motion data processing code we're writing. In the seis-prov documentation (see here: http://seismicdata.github.io/SEIS-PROV/_generated_details.html#seis-prov-ids)

it states that the third element of a seis-prov id is: "A 7 to 12 letter lowercase alphanumeric hash to ensure uniqueness of ids."

What is the intended use of this hash, and what is the domain where the uniqueness is being considered? Is it just within the document, or across a span of them?

krischer commented 5 years ago

What is the intended use of this hash, and what is the domain where the uniqueness is being considered? Is it just within the document, or across a span of them?

The hash is really just needed to be able to generate unique ids for every piece of the provenance information. In an ideal scenario it should of course be globally unique with a central registry but that will not happen any time soon so for now this id should be unique at least per W3C PROV document and also per ASDF file (each ASDF file can contain multiple provenance documents). If you have a way to ensure uniqueness across your whole system, that would be even better.

If you see any issue with type of id please let me know!

krischer commented 5 years ago

No activity in a while. Please reopen if this is still an open question.