dulibrarytech / digitaldu-frontend

Digital Collections DU front end
Apache License 2.0
3 stars 0 forks source link

Pdfjs - Loading damaged pdf causes crash #412

Closed jrynhart closed 2 months ago

jrynhart commented 2 months ago

Warning before app crash: "Warning: Indexing all pdfs"

pdfjs may be throwing this, investigate handling the error

jrynhart commented 2 months ago

The message "Warning: Indexing all pdfs" concludes the log file entries before an app restart (occurred 4X) and this message can be reproduced, by loading the object that was loaded prior to the message appearing.

Example: 943b0b55-c968-4c46-8771-7e565d78a121 This is a tif, not pdf. The object is 404 when loaded by the universal viewer, potentially causing some internal problem. the UV pdf lib (pdf.combined.js ) is logging the message.

This tif is not found in duracloud (404). The jpg derivative is not present in the storage folder.

A crash has not been reproduced. Testing pdf files has not reproduced the message or crash.

Will monitor for future issues, and potentially upgrade UV to v4.0.25 (current is v3.1.4)

jrynhart commented 2 months ago

Can reproduce crash when loading a corrupted pdf file

Found exception thrown from node lib page-count > pdf-parse > pdf.js Handled exception in pdfUtils.js lib Ref#3ce3091ba27d6263303b8684a14f1846f08ab3e8

jrynhart commented 2 months ago

Verified the exception is no longer causing the application to crash and stop running