clarin-eric / metadata-conversion

Conversion between metadata formats
1 stars 1 forks source link

Datacite: implement datacite - CMDI conversion #9

Closed twagoo closed 4 years ago

twagoo commented 6 years ago

See datacite schema

twagoo commented 6 years ago

Can be carried out in the context of EOSC-hub

twagoo commented 6 years ago

For example, we would like to be able to harvest and import part of https://oai.datacite.org/oai?verb=ListRecords&metadataPrefix=oai_dc&set=DELFT.UU. Note that the partial import (in this case only import the records with Subject and Keywords = Humanities - Languages and literature (6.2) or Humanities - Other humanities (6.5)) requires work on the OAI-PMH harvester.

twagoo commented 5 years ago

https://github.com/clarin-eric/oai-harvest-manager/commit/72fb9dfeb258abf58f81de32e9682cff81a2ed1b

twagoo commented 5 years ago

For example, we would like to be able to harvest and import part of https://oai.datacite.org/oai?verb=ListRecords&metadataPrefix=oai_dc&set=DELFT.UU. Note that the partial import (in this case only import the records with Subject and Keywords = Humanities - Languages and literature (6.2) or Humanities - Other humanities (6.5)) requires work on the OAI-PMH harvester.

Note that the Datacite OAI-PMH endpoint supports arbitrary queries (see docs) which might provide an easier path to solving the partial harvesting problem.

twagoo commented 5 years ago

Request with the desired filter:

> base64('q=*.*&fq=subject:(6.2 OR 6.5)')
cT0qLiomZnE9c3ViamVjdDooNi4yIE9SIDYuNSk=

-> set=DELFT.UU~cT0qLiomZnE9c3ViamVjdDooNi4yIE9SIDYuNSk=

twagoo commented 4 years ago

Records are now in the production VLO. Potential next step: custom conversion to improve quality.