mnylc / islandora_multi_importer

This is a flexible, twig based, all cmodel, tabular data to islandora Object importer with optional ZeroMQ processing
GNU General Public License v3.0
16 stars 15 forks source link

ISSUE-119: One PDF to rule them all (not all really just the Book) #124

Closed DiegoPino closed 3 years ago

DiegoPino commented 4 years ago

What is this?

After cleaning up all what i thought was actually working (and not) and committing too many silly pieces of code the PDF to TIFF to pages auto-magic processor is done. (may have more bugs but nothing we can not fix during daylight and some good coffee)

What/how this work?

Paged CMODELS now have an extra mapping option for their OBJ datastream )(scroll down on the list when doing the CMODEL mapping). This option allows the page to a Parent Object's (Book/Newspaper Issue) assigned PDF file (in the column of your choice). If you actually create all the rows with Pages, IMI will respect those and only extract from PDF to tiff the sequences stated in each Sequence column. But. If you care little about page level metadata or you have all the same in each, create a single Page with a sequence number of 0. That is the secret command that tells archipelago to create the Pages for you.

And it works. Yes. Please send coffee!