globalbioticinteractions / usnm

Configuration for GloBI to index collections of the National Museum of Natural History, Smithsonian Institution
1 stars 0 forks source link

anecdotal non-resolving ark identifiers in US Natural Museum collection #1

Open jhpoelen opened 3 years ago

jhpoelen commented 3 years ago

In reviewing the USNM datasets during the Parasite Tracker review process, @seltmann and @jhpoelen noticed that some ark ids are resolving, but others are not.

An example of a resolving identifier (see also screenshot):

http://n2t.net/ark:/65665/3abe3bc29-0652-4ae9-9cbc-b4f791c9d9b4

Screenshot from 2020-12-04 13-43-13

An example of a non-resolving identifier (see also screenshot):

http://n2t.net/ark:/65665/3b5bb1f4a-5146-4b6f-9385-7c3b969d5870

Screenshot from 2020-12-04 13-42-54

Both identifiers are known to GloBI and can be found via

https://www.globalbioticinteractions.org/?accordingTo=http%3A%2F%2Fn2t.net%2Fark%3A%2F65665%2F3abe3bc29-0652-4ae9-9cbc-b4f791c9d9b4&interactionType=ecologicallyRelatedTo

Screenshot from 2020-12-04 13-41-09

and

https://www.globalbioticinteractions.org/?accordingTo=http%3A%2F%2Fn2t.net%2Fark%3A%2F65665%2F3b5bb1f4a-5146-4b6f-9385-7c3b969d5870%20&interactionType=ecologicallyRelatedTo

Screenshot from 2020-12-04 13-44-53

jhpoelen commented 3 years ago

According to Karen R. , a museum specialist at the Smithsonian Department of Invertebrate Zoology, the non-resolving ark (archival resource key) like ark:/65665/3a71f5881-42ff-4f05-8cc4-50b7144f6dac have been deleted as part of a de-duplication effort.

The following have been updated as part of a Georeference project

ark:/65665/387f21a69-b629-4330-858f-7ea60e3ca132
ark:/65665/3abe3bc29-0652-4ae9-9cbc-b4f791c9d9b4

Removed from
ark:/65665/3a71f5881-42ff-4f05-8cc4-50b7144f6dac

Just you to give everyone a little background here. As you know, we acquired the USNPC collection and imported the USNPC database into our database. As we have working though curating, etc. of this collection we have come across duplicated records from the import and have retire one of the records.

@seltmann I propose to keep this issue open until we confirm that the updated USNM datasets no longer contain these duplicate records. Also, I imagine that USNM has considered ark redirection for de-duplicated records, but decided not to implement a (complicated) redirection scheme.