daitss / core

DAITSS: Dark Archive In The Sunshine State
GNU General Public License v3.0
9 stars 2 forks source link

Running into exception error with pdf #785

Closed szanati closed 7 years ago

szanati commented 7 years ago

I received the following error for a package:

error while processing 1(sip-files/03-2013.pdf): bad status http://describe.fda.fcla.edu/describe?location=file:/var/daitss/data/work/EWFIOLKP4_0Q4VCV/files/original/1/data&uri=info%3Afda%2FEWFIOLKP4_0Q4VCV%2Ffile%2F1&originalName=sip-files%2F03-2013.pdf: 500 running into exception running into exception ClassCastException 'edu.harvard.hul.ois.jhove.module.pdf.PdfDictionary cannot be cast to edu.harvard.hul.ois.jhove.module.pdf.PdfStream' while processing 32 bytes of input /opt/web-services/sites/describe/releases/20160607000001/lib/format/formatbase.rb:33:in method_missing'\n/opt/web-services/sites/describe/releases/20160607000001/lib/format/formatbase.rb:33:inextract'\n/opt/web-services/sites/describe/releases/20160607000001/lib/formatpool.rb:79:in block in extractAll'\n/opt/web-services/sites/describe/releases/20160607000001/lib/formatpool.rb:71:ineach'\n/opt/web-services/sites/describe/releases/20160607000001/lib/formatpool.rb:71:in extractAll'\n/opt/web-services/sites/describe/releases/20160607000001/lib/formatpool.rb:34:indescribe'\n/opt/web-services/sites/describe/releases/20160607000001/app.rb:151:in block in '\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1603:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1603:in block in compile!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:966:in[]'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:966:in block (3 levels) in route!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:985:inroute_eval'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:966:in block (2 levels) in route!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1006:inblock in process_route'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1004:in catch'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1004:inprocess_route'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:964:in block in route!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:963:ineach'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:963:in route!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1076:inblock in dispatch!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:in block in invoke'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:incatch'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:in invoke'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1073:indispatch!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:898:in block in call!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:inblock in invoke'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:in catch'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:ininvoke'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:898:in call!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:886:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-1.5.2/lib/rack/commonlogger.rb:33:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:217:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/xss_header.rb:18:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/path_traversal.rb:16:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/json_csrf.rb:18:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/base.rb:49:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/base.rb:49:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/frame_options.rb:31:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-1.5.2/lib/rack/nulllogger.rb:9:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-1.5.2/lib/rack/head.rb:11:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-1.5.2/lib/rack/methodoverride.rb:21:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:180:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:2014:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1478:inblock in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1788:in synchronize'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1478:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/connection.rb:86:in block in pre_process'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/connection.rb:84:incatch'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/connection.rb:84:in pre_process'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/connection.rb:53:inprocess'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/connection.rb:39:in receive_data'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/eventmachine-1.0.3/lib/eventmachine.rb:187:inrun_machine'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/eventmachine-1.0.3/lib/eventmachine.rb:187:in run'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/backends/base.rb:73:instart'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/server.rb:162:in start'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/controllers/controller.rb:87:instart'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/runner.rb:199:in run_command'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/runner.rb:155:inrun!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/bin/thin:6:in '\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/bin/thin:23:inload'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/bin/thin:23:in ' while processing sip-files/03-2013.pdf /opt/web-services/sites/describe/releases/20160607000001/lib/format/formatbase.rb:55:inrescue in extract'\n/opt/web-services/sites/describe/releases/20160607000001/lib/format/formatbase.rb:32:in extract'\n/opt/web-services/sites/describe/releases/20160607000001/lib/formatpool.rb:79:inblock in extractAll'\n/opt/web-services/sites/describe/releases/20160607000001/lib/formatpool.rb:71:in each'\n/opt/web-services/sites/describe/releases/20160607000001/lib/formatpool.rb:71:inextractAll'\n/opt/web-services/sites/describe/releases/20160607000001/lib/formatpool.rb:34:in describe'\n/opt/web-services/sites/describe/releases/20160607000001/app.rb:151:inblock in '\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1603:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1603:inblock in compile!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:966:in []'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:966:inblock (3 levels) in route!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:985:in route_eval'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:966:inblock (2 levels) in route!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1006:in block in process_route'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1004:incatch'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1004:in process_route'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:964:inblock in route!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:963:in each'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:963:inroute!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1076:in block in dispatch!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:inblock in invoke'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:in catch'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:ininvoke'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1073:in dispatch!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:898:inblock in call!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:in block in invoke'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:incatch'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1058:in invoke'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:898:incall!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:886:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-1.5.2/lib/rack/commonlogger.rb:33:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:217:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/xss_header.rb:18:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/path_traversal.rb:16:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/json_csrf.rb:18:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/base.rb:49:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/base.rb:49:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-protection-1.5.3/lib/rack/protection/frame_options.rb:31:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-1.5.2/lib/rack/nulllogger.rb:9:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-1.5.2/lib/rack/head.rb:11:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/rack-1.5.2/lib/rack/methodoverride.rb:21:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:180:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:2014:incall'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1478:in block in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1788:insynchronize'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/sinatra-1.4.5/lib/sinatra/base.rb:1478:in call'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/connection.rb:86:inblock in pre_process'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/connection.rb:84:in catch'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/connection.rb:84:inpre_process'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/connection.rb:53:in process'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/connection.rb:39:inreceive_data'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/eventmachine-1.0.3/lib/eventmachine.rb:187:in run_machine'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/eventmachine-1.0.3/lib/eventmachine.rb:187:inrun'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/backends/base.rb:73:in start'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/server.rb:162:instart'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/controllers/controller.rb:87:in start'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/runner.rb:199:inrun_command'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/lib/thin/runner.rb:155:in run!'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/gems/thin-1.6.2/bin/thin:6:in'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/bin/thin:23:in load'\n/opt/web-services/sites/describe/shared/bundle/ruby/1.9.1/bin/thin:23:in '

trace

/opt/web-services/sites/core/releases/20160607000001/lib/daitss/proc/wip/preserve.rb:26:in rescue in block in preserve' /opt/web-services/sites/core/releases/20160607000001/lib/daitss/proc/wip/preserve.rb:19:inblock in preserve' /opt/web-services/sites/core/releases/20160607000001/lib/daitss/proc/wip/preserve.rb:18:in each' /opt/web-services/sites/core/releases/20160607000001/lib/daitss/proc/wip/preserve.rb:18:inpreserve' /opt/web-services/sites/core/releases/20160607000001/lib/daitss/proc/wip/ingest.rb:33:in ingest' /opt/web-services/sites/core/releases/20160607000001/lib/daitss/proc/wip/process.rb:82:inblock in spawn' /opt/web-services/sites/core/releases/20160607000001/lib/daitss/proc/wip/process.rb:66:in fork' /opt/web-services/sites/core/releases/20160607000001/lib/daitss/proc/wip/process.rb:66:inspawn' /opt/web-services/sites/core/current/bin/pulse:161:in block in start_wips' /opt/web-services/sites/core/current/bin/pulse:158:ineach' /opt/web-services/sites/core/current/bin/pulse:158:in start_wips' /opt/web-services/sites/core/current/bin/pulse:194:inblock in ' /opt/web-services/sites/core/current/bin/pulse:192:in loop' /opt/web-services/sites/core/current/bin/pulse:192:in '

I reset and it gives the same error. I tried it on Ripple and got the same error.

szanati commented 7 years ago

I have added this github issue to the Projectplace card for Carol

lydiam commented 7 years ago

Testing email notifications from DAITSS issues.

cchou commented 7 years ago

This package fails while processing the PDF, 03-2013.pdf, in the package. I have put the problem file at http://www.fcla.edu/daitss-test/files/03-2013.pdf. The exception is raised inside JHOVE.

cchou commented 7 years ago

I have installed the latest JHOVE, version 1.14 build on 05-12-2016, from openpreserve who is currently maintaining JHOVE, but JHOVE still chokes with the same error.

I have filed this issue to openpreserve, https://github.com/openpreserve/jhove/issues/173, hopefully to hear back from them soon.

cchou commented 7 years ago

Openpreserve has not responded about this issue, so it doesn't look like we can depend on them.

I have gone ahead and in fixed up JHOVE code. The new fixed code have been rolled out to ripple test server. The next step is for production staff to do the user testing.

szanati commented 7 years ago

A fix for this issues was rolled out on 4-6-17. The package with the problem pdf archived.