We have a large number of PDFs that are getting a Java language exception when JHOVE attempts to validate. An example can be downloaded from: http://gac.canadiana.ca/view/ooe.b4222507_008 (Download PDF button is beside the image resize - + buttons.)
Note: pdfinfo is from poppler-utils, and identify is from ImageMagick. Identify is able to render all the PDF pages to an image, which is what it does to check if a PDF file is working. The PDF files in question will render in all the PDF viewers we have tested with.
Issue was also discussed in the jhove mailing list. We have a couple thousand PDF files that give a similar report in our repository which might be having the same issue.
If it turns out the problem is with the PDF file and not JHOVE, can someone with more knowledge of the PDF file format document how it is broken so that a report can be sent to https://poppler.freedesktop.org/ (and possibly other projects, but I haven't checked which tools generated all the PDF files that JHOVE is flagging).
Dev Effort
1D - investigation
Description
We have a large number of PDFs that are getting a Java language exception when JHOVE attempts to validate. An example can be downloaded from: http://gac.canadiana.ca/view/ooe.b4222507_008 (Download PDF button is beside the image resize - + buttons.)
Note: pdfinfo is from poppler-utils, and identify is from ImageMagick. Identify is able to render all the PDF pages to an image, which is what it does to check if a PDF file is working. The PDF files in question will render in all the PDF viewers we have tested with.
Issue was also discussed in the jhove mailing list. We have a couple thousand PDF files that give a similar report in our repository which might be having the same issue.
If it turns out the problem is with the PDF file and not JHOVE, can someone with more knowledge of the PDF file format document how it is broken so that a report can be sent to https://poppler.freedesktop.org/ (and possibly other projects, but I haven't checked which tools generated all the PDF files that JHOVE is flagging).