digirati-co-uk / pmc-lux

Transforming data from PMC catalogues for import to LUX
MIT License
1 stars 0 forks source link

Library – confirm preferred categorisation for HMO and LO #6

Open brutaldigital opened 2 months ago

brutaldigital commented 2 months ago

Rob,

Please can you confirm that the categorisation of HMO as 'texts' is preferable to 'books'?

YCBA appears to use 'books' for HMO and 'texts' for LO.

HMO: https://linked-art.library.yale.edu/node/a67dc503-fcf5-44f0-92b2-cb3cf4a0799e LO: https://linked-art.library.yale.edu/node/2897515f-2e5e-412c-97a0-f1908a6af87c

azaroth42 commented 2 months ago

YCBA is correct here. If we know that it's a book format of object, then it would be better to have books on the HMO.

tomcrane commented 1 month ago

To confirm - Action for me here is to classify all Library LOs as texts (classified as type of work) and all Library HMOs as books (classified as type of object). Neither is happening atm.

tomcrane commented 1 month ago

Actually that's not true - the LOs are being classified according to the value of <medium> using this lookup:

https://github.com/tomcrane/linked-art-net/blob/pmc/LinkedArt/PmcTransformer/Library/Media.cs#L20-L35

tomcrane commented 1 month ago

So <medium>Text</medium> results in classified as "Text" https://vocab.getty.edu/aat/300263751 (25167 of these) - and lots of other media.

But no classification of HMOs, seems like at least some of these should be reallocated to the HMOs? e.g., DVD->type of object

tomcrane commented 1 month ago

Leave open because we might get new values and rules - might need a combination of medium and class to determine classification.

brutaldigital commented 1 month ago

Emma has made updates to the CLASS fields to better identify the medium. Revised values are:

<class>AUDIO VISUAL - SOFTWARE</class>
<class>AUDIO VISUAL - VIDEO</class>
<class>AUDIO VISUAL - AUDIO</class>
            //               to be applied to:      (the LinguisticObject, the HumanMadeObjects)
            MediaDict[InformationFiles.Label!]    = (Text, InformationFiles);
            MediaDict[Text.Label!]                = (Text, Getty.Book) ;
            MediaDict[ExhibitionCatalogue.Label!] = (ExhibitionCatalogue, Getty.Book);
            MediaDict[AuctionCatalogue.Label!]    = (AuctionCatalogue, Getty.Book);
            MediaDict[Journal.Label!]             = (Journal, null);
            MediaDict[CDRom.Label!]               = (null, CDRom);
            MediaDict[Website.Label!]             = (Website, Website);
            MediaDict[Manuscript.Label!]          = (Text, Manuscript);
            MediaDict[DVD.Label!]                 = (null, DVD);
            MediaDict[Microfilm.Label!]           = (null, Microfilm);
            MediaDict[CD.Label!]                  = (Getty.Sound, CD);