daniellecrobinson / Data-Rescue-PDX

Volunteer guide, and other materials for DATA RESCUE PDX
30 stars 6 forks source link

Consider persistent IDs #32

Open mellybelly opened 7 years ago

mellybelly commented 7 years ago

Maybe phase II, but you might consider including persistent IDs where we have them, for example for organizations you can use GRID IDs from Digital Science.

Conversely, as these records get created, what will their persistent IDs be?

max-mapper commented 7 years ago

Good question, I asked the Data.gov about this and they said:

I'd love to discuss UUID's some more. we recommend globally unique IRI's for the required unique identifier on all dataset metadata, and used DOI's as the example, but unfortunately the policy only requires identifiers to be unique within each agency and sadly that's what the majority of them are doing.

We should really set a clear path to transitioning these to UUID's or to providing something like DOIs. There is some spotty use of DOI's in the federal government, and I think LOC mostly uses this Handle server - http://hdl.loc.gov

Data.gov generates UUIDs through CKAN for each dataset but we really only use that for our own purposes

I also have been trying to learn more about PIDs and did a blog post here that raises some questions at the bottom in regards to their use for archival purposes https://datproject.org/blog/2016-11-11-pidapalooza, but I haven't check out GRID IDs, thanks for the tip!

mellybelly commented 7 years ago

see our treatise here: https://zenodo.org/record/163459

On Feb 6, 2017, at 10:00 AM, maxogden notifications@github.com<mailto:notifications@github.com> wrote:

Good question, I asked the Data.govhttp://data.gov about this and they said:

I'd love to discuss UUID's some more. we recommend globally unique IRI's for the required unique identifier on all dataset metadata, and used DOI's as the example, but unfortunately the policy only requires identifiers to be unique within each agency and sadly that's what the majority of them are doing.

We should really set a clear path to transitioning these to UUID's or to providing something like DOIs. There is some spotty use of DOI's in the federal government, and I think LOC mostly uses this Handle server - http://hdl.loc.govhttp://hdl.loc.gov/

Data.govhttp://data.gov generates UUIDs through CKAN for each dataset but we really only use that for our own purposes

I also have been trying to learn more about PIDs and did a blog post here that raises some questions at the bottom in regards to their use for archival purposes https://datproject.org/blog/2016-11-11-pidapalooza, but I haven't check out GRID IDs, thanks for the tip!

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/daniellecrobinson/Data-Rescue-PDX/issues/32#issuecomment-277761493, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AAwn3Kp1Gc_8wAGAN8izG_aeRUXUKnO7ks5rZ1-lgaJpZM4L3hx0.

Melissa Haendel, PhD Associate Professor Library & Dept. of Medical Informatics and Clinical Epidemiology haendel@ohsu.edumailto:haendel@ohsu.edu 503-407-5970 www.monarchinitiative.orghttp://www.monarchinitiative.org

Appointments: Shanez De Silva desilva@ohsu.edumailto:desilva@ohsu.edu

daniellecrobinson commented 7 years ago

Would love to get you both in a room to chat about this!