rdmpage / index-fungorum-coldp

Index Fungorum as a Catalogue of Life Data Package (ColDP)
0 stars 0 forks source link

Replaced DOIs #1

Open rdmpage opened 2 months ago

rdmpage commented 2 months ago

There are cases such as IF:18054 where the DOI originally assigned to the publications (https://doi.org/10.2307/2438513) has been replaced by a new one as the journal has a new publisher. In this case the DOI is https://doi.org/10.1002/j.1537-2197.1957.tb10560.x. The old JSTOR DOI https://doi.org/10.2307/2438513 still resolves, but as a redirect to the new DOI https://doi.org/10.1002/j.1537-2197.1957.tb10560.x:

HTTP/1.1 301 Moved Permanently

Alt-Svc: h3=":443"; ma=86400
NEL: {"success_fraction":0,"report_to":"cf-nel","max_age":604800}
Server: cloudflare
Report-To: {"endpoints":[{"url":"https:\/\/a.nel.cloudflare.com\/report\/v4?s=mhPeAVT3t3AsOcEstXw%2Fo11cpVtimqvuhH7SL%2B4iY5ZJHvFgIBiWtuuWSVTXLTlsS0j%2BdURkJMvT%2B6bh85ftYuSVFHysRgV%2Bix0nCD5r37Wq9NU%2B6w79dNo%3D"}],"group":"cf-nel","max_age":604800}
Expires: Sun, 21 Jul 2024 09:49:54 GMT
CF-Cache-Status: DYNAMIC
permissions-policy: interest-cohort=(),browsing-topics=()
Location: https://doi.org/10.1002/j.1537-2197.1957.tb10560.x
Date: Sat, 20 Jul 2024 09:49:55 GMT
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
Content-Length: 0
Connection: keep-alive
CF-RAY: 8a620b42ad226aad-MAN

We need to address these.

rdmpage commented 2 months ago

One reason this matters is that Wikidata may have a record for one DOI but not the other, that is hence we end up failing to make a match between DOI and Wikidata. But if we try and then add the "missing" DOI my tools refuse to do this because there is already an item that matches the bibliographic details (i.e., associated with the other DOI).

rdmpage commented 2 months ago

I've added the fields old_doi and parent_doi to the names table. old_doi is intended to have the older DOI that has since been replaced. parent_doi is intended for cases where we have a publication, say in BHL, where the entire publication has a DOI but we don't have persistent identifiers for individual parts.