thedatahub / Datahub-Factory

Datahub::Factory - Transport metadata between Collection Management Systems and the Datahub
Other
2 stars 4 forks source link

Import OAI/LIDO into Datahub doesn't work #49

Open netsensei opened 7 years ago

netsensei commented 7 years ago

The Yale British Center for Arts provides an OAI-PMH endpoint with collection data.

https://britishart.yale.edu/collections/using-collections/technology/harvest-xml-data

This dataset is already transformed into LIDO XML and transforming is not necessary. Just passing it along from the OAI to the Datahub suffices.

Catmandu however, will transform the data into an intermediate, internal JSON structure to appropriate potential transformations. However, the transformation XML > JSON > XML doesn't guarantee that the output will be exactly the same as the input.

Moreover, the intermediate JSON structure isn't suited to be transformed back to LIDO with a simple XML exporter. The resulting XML doesn't resemble LIDO.

So, make it possible to fetch XML data from a source and pass it along to a destination without obligatory transformation.

See: https://github.com/thedatahub/Datahub-Factory-Pipelines/blob/master/OAItoDatahub.ini

pieterdp commented 6 years ago

This can be solved with two minor modifications:

No conversion is done.