crkn-rcdr / cihm-metadatabus

Documentation and Docker build environment for key portions of the metadata bus.
BSD 2-Clause "Simplified" License
0 stars 1 forks source link

More convenient handling of ocr for items with multiple manifests per language #100

Closed brittnylapierre closed 2 months ago

brittnylapierre commented 4 months ago

We don't need to run them through abby separately but do need to run OCR import jobs separately because they are different manifests in the system - as long as pdf and xml the filenames match "b1603413E" and "b1603413F" for the respective manifest.

So you could use the same files abby produced by renaming them.

Adding this note to the metadatabus for now that this is how multilingual texts are handled so I don't forget to see if this can be made easier either in our script or in any software that replaces it~