CDLUC3 / ezid

CDLUC3 ezid
MIT License
10 stars 4 forks source link

Assist UCB to retire resolver (nma.berkeley.edu) #601

Open jsjiang opened 8 months ago

jsjiang commented 8 months ago

Request from UCB:

Name: David Triebwasser Email: davidtriebwasser@berkeley.edu Institution: UC Berkeley Reason for contact: I have a problem or question about existing account Comment: We want to retire our resolver (nma.berkeley.edu). We would like to meet with a technical contact to discuss our options. We are interested in transition all the existing links to EZID if that's possible.

Comment from Dave: UC Berkeley has a number of prefixes including DOIs and ARKs. First step is getting more specifics on the request. Is this for all identifiers or a specific prefix (NAAN)?

jsjiang commented 3 months ago

Shoulders:

Sample ARKs and location URLs: ark:/13030/hb - no minter:

ark:/28722/bk - no minter

jsjiang commented 3 months ago

From Dave: shoulders with record count

28722/bk = 513856 28722/h2 = 161

13030/ft = 10197 13030/k6 = 37951 13030/hb = 14866 13030/kt = 1503 13030/tf = 30757

ark_samples.xlsx

adambuttrick commented 3 months ago

Mapped to existing shoulder names in EZID. All shoulders appear to be known to us:

Shoulder Shoulder Name Count
ark:/13030/ft Online Archive of California (OAC) Legacy ft 10197
ark:/13030/k6 UC Berkeley/Digital Special Collections 37951
ark:/13030/hb UC Berkeley Legacy Library 14866
ark:/13030/kt Online Archive of California (OAC) 1503
ark:/13030/tf Online Archive of California (OAC) Legacy tf 30757
ark:/28722/bk UC Berkeley Legacy Library bk 513856
ark:/28722/h2 UC Berkeley Library - HathiTrust/UCB 161
adambuttrick commented 2 months ago

Additional testing:

Shoulder Test Result
ark:/13030/ft Sample IDs do not resolve and do not appear to be in EZID. EZID has 416 total records under this shoulder, all of which belong to Merritt.
ark:/13030/hb Sample IDs resolve and all belong to Merritt.
ark:/13030/k6 Sample IDs resolve and all belong to UCB library. Do not contain target/location URLs. Links provided are for the EZID records.
ark:/13030/kt Sample IDs resolve and all belong to Merritt.
ark:/13030/tf One ID (ark:/13030/tf00000195) known to us, resolves, and owned by Merritt. The rest do not resolve and do not appear to be in EZID.
ark:/28722/bk Sample IDs do not resolve and do not appear to be in EZID. Search timed out, so unable to determine extent of records.
ark:/28722/h2 Sample IDs resolve and all belong to UCB library. Do not contain target/location URLs. Links provided are for the EZID records.
jsjiang commented 2 months ago

Shoulder and user relationships:

prefix shoulderName minter username userDisplayname
ark:/13030/ft Online Archive of California (OAC) Legacy ft merritt CDL UC3 Merritt
ark:/13030/hb UC Berkeley Legacy Library ucb-genx for use by GenX to mint EAD ARKs
ark:/13030/hb UC Berkeley Legacy Library merritt CDL UC3 Merritt
ark:/13030/hb UC Berkeley Legacy Library ucrlibrary UC Riverside Library
ark:/13030/hb UC Berkeley Legacy Library ucr-digi UCR Library - Digital Initiatives
ark:/13030/hb UC Berkeley Legacy Library ucblibrary UC Berkeley Library
ark:/13030/k6 UC Berkeley/Digital Special Collections https://n2t.net/a/ezid/m/ark/13030/k6 cdldsc CDL Digital Special Collections
ark:/13030/k6 UC Berkeley/Digital Special Collections https://n2t.net/a/ezid/m/ark/13030/k6 ucb-genx for use by GenX to mint EAD ARKs
ark:/13030/k6 UC Berkeley/Digital Special Collections https://n2t.net/a/ezid/m/ark/13030/k6 merritt CDL UC3 Merritt
ark:/13030/k6 UC Berkeley/Digital Special Collections https://n2t.net/a/ezid/m/ark/13030/k6 ucblibrary UC Berkeley Library
ark:/13030/kt Online Archive of California (OAC) merritt CDL UC3 Merritt
ark:/13030/tf Online Archive of California (OAC) Legacy tf merritt CDL UC3 Merritt
ark:/28722/bk UC Berkeley Legacy Library bk merritt CDL UC3 Merritt
ark:/28722/bk UC Berkeley Legacy Library bk ucblibrary UC Berkeley Library
ark:/28722/h2 UC Berkeley Library - HathiTrust/UCB https://n2t.net/a/ezid/m/ark/28722/h2 ucb-genx for use by GenX to mint EAD ARKs
ark:/28722/h2 UC Berkeley Library - HathiTrust/UCB https://n2t.net/a/ezid/m/ark/28722/h2 ucblibrary UC Berkeley Library
select s.prefix, s.name as shoulderName, s.minter,
u.username, u.displayName as userDisplayname from ezidapp_user u
join ezidapp_user_shoulders us 
on u.id = us.`user_id`
join ezidapp_shoulder s
on s.id = us.shoulder_id
where s.id in (
select id from ezidapp_shoulder where prefix in (
'ark:/28722/bk',
'ark:/28722/h2',
'ark:/13030/ft', 
'ark:/13030/k6', 
'ark:/13030/hb',
'ark:/13030/kt',
'ark:/13030/tf'
)) order by s.prefix;
jsjiang commented 2 months ago

Note: we need to assign an owner (user.id) to each ARK.