wellcomecollection / catalogue-pipeline

:oil_drum: The data pipeline services extracting & transforming data from our museum and collections.
https://developers.wellcomecollection.org/catalogue
MIT License
13 stars 2 forks source link

Harmonise EBSCO and Sierra MatType codes #2572

Closed paul-butcher closed 4 months ago

paul-butcher commented 7 months ago

The materialType code is found in the leader. (character 6). However, this is not necessarily the same value as the one in Sierra which we rely on to define MaterialTypes in the API. (The Sierra documentation states: "However, this data is customizable in Sierra and each organization determines the codes and strings for their system.")

e.g. Sierra d means Journal, but d in MARC Leader 06 https://www.loc.gov/marc/bibliographic/bdleader.html is "Manuscript notated music". We have several values of materialType that are not in the official list on Leader.

Journals from EBSCO have s in this position, and monographs m.

We need to map the values found here and/or in 008 to a value that corresponds with the one we use.

We may also need to do something about the label.

Originally posted by @paul-butcher in https://github.com/wellcomecollection/catalogue-pipeline/issues/2563#issuecomment-1978470058

paul-butcher commented 7 months ago

Part of https://github.com/wellcomecollection/platform/issues/5738

kenoir commented 4 months ago

Closing as done.