pulibrary / figgy

Valkyrie-based digital repository backend.
Other
35 stars 4 forks source link

Ingest "Arabic movie posters" (pudl0100) #1612

Closed tpendragon closed 4 years ago

tpendragon commented 5 years ago

Notes: MODS

Details: https://docs.google.com/spreadsheets/d/1f-c3goE0CC2QJ0Cbk_-JN67nk3fpI9J2Dl37AhGIzos/edit#gid=33 Depends on #1674 Depends on #1716

tpendragon commented 5 years ago

Files: https://drive.google.com/drive/u/0/folders/0B4Wo5hgOEFY3ZlZGUU5OS3FIbDA

tpendragon commented 5 years ago

http://pudl.princeton.edu/collections/pudl0100

jrgriffiniii commented 5 years ago

There are three different filename prefixes for the METS files within the directory posters:

These seem to be linked to the countries of origin (Lebanon, Egypt, and Iraq), but for Lebanon no placeTerm is provided.

Within the directory lc, (for "Lobby cards" as opposed to "posters") the files continue for Lebanon, Egypt, and Syria with some additional prefixes:

jrgriffiniii commented 5 years ago

The MODS metadata within the METS files follows this structure:

Field Element Language Script XPath Example Authorities/Encoding Standards
Title titleInfo US English Latin mods:mods/mods:titleInfo/mods:title The Golden Age of Comedy  
Title titleInfo Arabic Arabic mods:mods/mods:titleInfo/mods:title ابن مين المجتمع  
Resource Type typeOfResource US English Latin mods:mods/mods:typeOfResource still image  
Genre genre US English Latin mods:mods/mods:genre Posters AAT
Date Created dateCreated N/A N/A mods:mods/mods:originInfo/mods:dateCreated 1901-2000  
Language languageTerm N/A N/A mods:mods/mods:language/mods:languageTerm ara ISO639-2b
Extent extent US English Latin mods:mods/mods:physicalDescription/mods:extent 1 piece.  
Collection collection US English Latin mods:mods/mods:relatedItem[@type="host"]/mods:titleInfo/mods:title Middle Eastern Film Posters Digitization Initiative  
Identifier identifier N/A N/A mods:mods/mods:identifier le1-0143  
Use Rights accessCondition US English Latin mods:mods/mods:accessCondition[@type="useAndReproduction"] http://www.princeton.edu/~rbsc/research/rights.html  
Access Restrictions accessCondition US English Latin mods:mods/mods:accessCondition[@type="restrictionOnAccess"] http://www.princeton.edu/~rbsc/research/rules.html  
Actor namePart Arabic Arabic mods:mods/mods:name/mods:roleTerm[text()="act"]/../mods:namePart نعيم حلاوي  
Director namePart Arabic Arabic mods:mods/mods:name/mods:roleTerm[text()="drt"]/../mods:namePart شادي حنا  
Place place US English Latin mods:mods/mods:originInfo/mods:place/mods:placeTerm Iraq  
jrgriffiniii commented 5 years ago

There is additional MODS metadata describing the physical location of these items which might be needed for any MARC records derived from this:

   <mods:location>
      <mods:physicalLocation type="text">Princeton University Library. Firestone Library.</mods:physicalLocation>
      <mods:physicalLocation authority="marcorg" type="code">NjP</mods:physicalLocation>
      <mods:holdingSimple>
         <mods:copyInformation>
            <mods:subLocation>Curator's office</mods:subLocation>
            <mods:shelfLocator>Cabinet 11/13</mods:shelfLocator>
         </mods:copyInformation>
      </mods:holdingSimple>
   </mods:location>

These values could be mapped to attributes such as holding_location or location.

tpendragon commented 5 years ago

The metadata for all these ingested in prod, they're now generating derivatives.

tpendragon commented 5 years ago

Emailed Kim Leaman to get confirmation that the items came over okay.

tpendragon commented 5 years ago

Requests/feedback from Kim:

escowles commented 5 years ago

These should be ingested as Ephemera, not as ScannedResources

tpendragon commented 5 years ago

:|

escowles commented 4 years ago

@kelea99 got approval to complete these as ScannedResources instead of reingesting as ephemera. I've bulk completed them.

escowles commented 4 years ago

I was starting to work up a Rake task to validate that METS files were ingested into Figgy, and noticed none of these objects were found because neither the replaces nor the ARKs were migrated. Reopening this ticket to track adding both of those.

escowles commented 4 years ago

Closing in favor of #3252