MarcusBarnes / islandora_compound_batch

Provides the basic ability to batch import compound objects into Islandora.
GNU General Public License v3.0
3 stars 12 forks source link

"Defer derivative generation during ingest" incompatible with islandora_compound_batch? #43

Open TristanSmithlib opened 5 years ago

TristanSmithlib commented 5 years ago

Use case: I'd like to do all of my derivative generation before I submit my batch to Islandora so that I can accelerated ingest rates.

Problem: When I check "Defer derivative generation during ingest" on /admin/islandora/configure and I create a batch set using islandora_compound_batch with the resulting batch when ingested contains empty objects only containing MODS, DC, and RELS-EXT. The objects don't even contain the TIF OBJ that was submitted!

If however when I uncheck "Defer derivative generation during ingest" on /admin/islandora/configure the resulting objects when ingested contains all of the appropriate datastreams, including the OBJ. My sense is though that those datastreams have been generated by Islandora and thus the versions that I pregenerated are not actually taken.

For both example cases above islandora_compound_batch was pointing at a directory full of object folders containing the appropriate datastreams with respective file names. Example:

./smith_ssc_324_digital_object_323
./smith_ssc_324_digital_object_323/structure.xml
./smith_ssc_324_digital_object_323/MODS.xml
./smith_ssc_324_digital_object_323/OCR.txt
./smith_ssc_324_digital_object_323/TN.jpg
./smith_ssc_324_digital_object_323/00001
./smith_ssc_324_digital_object_323/00001/JPG.jpg
./smith_ssc_324_digital_object_323/00001/JP2.jp2
./smith_ssc_324_digital_object_323/00001/MODS.xml
./smith_ssc_324_digital_object_323/00001/TN.jpg
./smith_ssc_324_digital_object_323/00001/OBJ.tif
./smith_ssc_324_digital_object_323/00002
./smith_ssc_324_digital_object_323/00002/JPG.jpg
./smith_ssc_324_digital_object_323/00002/JP2.jp2
./smith_ssc_324_digital_object_323/00002/MODS.xml
./smith_ssc_324_digital_object_323/00002/TN.jpg
./smith_ssc_324_digital_object_323/00002/OBJ.tif
./smith_ssc_324_digital_object_323/00003
./smith_ssc_324_digital_object_323/00003/JPG.jpg
./smith_ssc_324_digital_object_323/00003/JP2.jp2
./smith_ssc_324_digital_object_323/00003/MODS.xml
./smith_ssc_324_digital_object_323/00003/TN.jpg
./smith_ssc_324_digital_object_323/00003/OBJ.tif
...

Here are my exact commands:

drush -v --user=compass_admin islandora_compound_batch_preprocess --scan_target=/mnt/ingest/smith/compound-large-image-sample --namespace=test --parent=smith:test
drush -v --user=1 islandora_batch_ingest --ingest_set=774

I can send you a sample ingest directory if needed.

Possible desired outcomes:

  1. When I check "Defer derivative generation during ingest" take the supplied files and insert them into their respective datastreams (like the book object) especially the OBJ
  2. Add a configuration option for islandora_compound_batch like the book and newspaper solution packs that allows disabling for just the compound object type.

Assumptions: I've only tried this with large image objects using TIF files as the OBJ. I'm assuming that this is an issue for other child object types.