thedatahub / Datahub-Factory

Datahub::Factory - Transport metadata between Collection Management Systems and the Datahub
Other
2 stars 4 forks source link

Extract XML records from OAI endpoint #24

Closed netsensei closed 7 years ago

netsensei commented 7 years ago

The Catmandu-OAI module allows imports from an OAI endpoint like this:

catmandu convert OAI --url http://collections.britishart.yale.edu/oaicatmuseum/OAIHandler

The OAI input is converted to a JSON object in it's entirety. However, the integrity of the original XML records isn't guaranteed. If we use the raw handler, we can extract the XML records from the OAI enveloppe and deal with them separately:

catmandu convert OAI --url http://collections.britishart.yale.edu/oaicatmuseum/OAIHandler --handler raw

Which would also allow us to separate fetching records from parsing / transforming them to another format (json or yaml)

This issue is relevant for extracting LIDO data and sending it to another data sink.

netsensei commented 7 years ago

Use cases:

netsensei commented 7 years ago

This issue has been fixed.