gigascience / gigadb-website

Source code for running GigaDB
http://gigadb.org
GNU General Public License v3.0
9 stars 15 forks source link

PIDs for individual files in GigaDB #1988

Closed only1chunts closed 3 months ago

only1chunts commented 3 months ago

User story

As a user
I want to be able to point to a unique file within a dataset using a PID So that I can be sure I'm referencing the exact file of interest

Acceptance criteria

Given a file is present in a GigaDB dataset When I refer to that file from an external resource Then I can be sure I am using a globally unique persistent identifier of that file

Additional Info

The main reference point will always be the dataset as an entity, but we should enable a facility for any to specify down to the individual file exactly which file they are referring to. We could mint DOI's through datacite for all files, but I think that is excessive due to the need for a high degree of mandatory metadata, and would therefore complicate matters unessacerily. Instead I would prefer to use something more light-weight such as HANDLES or ARKS, my preference would be ARK, this link explains why.

Moving forwards, adding PIDs to individual files would allow us to then add all the files as metadata objects to the DataCite metadata as "relatedIdentifiers" thus further exposing metadata to a wider indexing and hopefully drive discovery.

Product Backlog Item Ready Checklist

Product Backlog Item Done Checklist

only1chunts commented 3 months ago

closing as this is already a ticket #1582 !