gigascience / gigadb-website

Source code for running GigaDB
http://gigadb.org
GNU General Public License v3.0
9 stars 15 forks source link

Bulk update DataCite metadata to keep them up-to-date #549

Open only1chunts opened 3 years ago

only1chunts commented 3 years ago

user story

As a curator I want to bulk update the DataCite metadata for all datasets So that we have up-to-date information for all datasets in DataCite

Acceptance criteria

Given there are a number of Datasets with incorrect entries in DataCite When I run the bulk update job Then all the incorrect DataCite entry are corrected

When #372 and #456 have been done, we then need a script to run over the database and update things in DataCite to the new XML so that we capture as many links as possible in DataCite.

This should include updating the datacite XML for the Ruili plant datasets that were originally registered in DataCite with minimal metadata.

This story depends on:

alli83 commented 1 month ago

@only1chunts it's a script that will only be run once and won't be reused, right?

only1chunts commented 1 month ago

The bulk update, yes, its a one off. If we need to do something similar in the future it will be because either the datacite schema or ours has been updated and/or more things need to be added, so at that point this script wouldn't be terribly useful anyway.

only1chunts commented 1 week ago

FYI - I attended a DataCite training course yesterday, it included some stuff on their API. They have shared the slides here https://zenodo.org/records/14018758 . There were multiple questions from the attendees about how to do bulk updates, and the short answer is that there is no bulk update facility, they suggested writting wrapper scrpits to do multiple individual updates.