Closed twagoo closed 4 years ago
Can be carried out in the context of EOSC-hub
For example, we would like to be able to harvest and import part of https://oai.datacite.org/oai?verb=ListRecords&metadataPrefix=oai_dc&set=DELFT.UU. Note that the partial import (in this case only import the records with Subject and Keywords = Humanities - Languages and literature (6.2) or Humanities - Other humanities (6.5)) requires work on the OAI-PMH harvester.
For example, we would like to be able to harvest and import part of https://oai.datacite.org/oai?verb=ListRecords&metadataPrefix=oai_dc&set=DELFT.UU. Note that the partial import (in this case only import the records with Subject and Keywords = Humanities - Languages and literature (6.2) or Humanities - Other humanities (6.5)) requires work on the OAI-PMH harvester.
Note that the Datacite OAI-PMH endpoint supports arbitrary queries (see docs) which might provide an easier path to solving the partial harvesting problem.
Request with the desired filter:
> base64('q=*.*&fq=subject:(6.2 OR 6.5)')
cT0qLiomZnE9c3ViamVjdDooNi4yIE9SIDYuNSk=
Records are now in the production VLO. Potential next step: custom conversion to improve quality.
See datacite schema