Closed mjordan closed 7 years ago
Looking at this now I can't recall why I split out newspapers and books. I think a more general option would be fine:
--allow_page_files=OCR.txt,JP2.jp2
There is already a list of optional but allowed files:
$allowed_ds_files = array(
'MODS.xml',
'DC.xml',
'JP2.jp2',
'JPEG.jpg',
'TN.jpg',
'TN.png',
'OCR.asc',
'OCR.txt',
'HOCR.shtml'
);
So we don't need an option. Closing.
Islandora Batch modules for newspapers and books allow the presence of page-level files besides the OBJ and MODS datastreams, e.g. JP2, OCR, etc. Currently, iipqa tests would fail if these files were present.
We should allow for a specific list of filenames to be present in directories that are being checked. Perhaps a command-line option like
--allow_for_books=OCR.txt,JP2.jp2
--allow_for_newspapers=OCR.txt