sul-dlss-deprecated / was-thumbnail-service

rails app to create and serve thumbnail images for web archiving seed URIs.
Other
1 stars 0 forks source link

issue capturing thumbnail for some sites: "#FAIL# Unable to load the address! with HTTP status: 200, HTTP message: OK" #216

Closed jmartin-sul closed 4 years ago

jmartin-sul commented 4 years ago

see https://app.honeybadger.io/projects/51217/faults/43409116

[PROJECT_ROOT]/lib/was/thumbnail_service/capture/capture_thumbnail.rb:21 :in `process_thumbnail`
[PROJECT_ROOT]/lib/was/thumbnail_service/capture/capture_job.rb:11 :in `perform`
[GEM_ROOT]/gems/delayed_job-4.1.8/lib/delayed/backend/base.rb:81 :in `block in invoke_job`
[GEM_ROOT]/gems/delayed_job-4.1.8/lib/delayed/lifecycle.rb:79 :in `block (2 levels) in add`
[GEM_ROOT]/gems/delayed_job-4.1.8/lib/delayed/lifecycle.rb:61 :in `block in initialize`
[GEM_ROOT]/gems/delayed_job-4.1.8/lib/delayed/lifecycle.rb:79 :in `block in add` 

Describe the bug

thumbnail capture for some websites fails with this error: #FAIL# Unable to load the address! with HTTP status: 200, HTTP message: OK

e.g.

RuntimeError: Thumbnail for memento https://swap.stanford.edu/20170705003503/http://www.thenationalteapartyfederation.com/ can't be generated. #FAIL# Unable to load the address! with HTTP status: 200, HTTP message: OK 

the most recent failures:

clarification from andrew: the thumbnail is hitting SWAP and thumbnailing our representation, not hitting the live site on the general web

(thanks to one of our info sec channels for showing me urlscan -- i'd be a bit hesitant about hitting the first two of those websites from my own browser these days 😆)

User Impact

automatic thumbnail capture does not work reliably for all websites.

To Reproduce Steps to reproduce the behavior:

  1. Go to '...'
  2. Click on '....'
  3. Scroll down to '....'
  4. See error

Expected behavior A clear and concise description of what you expected to happen.

Screenshots

n/a, see HB links and error text

Additional context Add any other context about the problem here.

andrewjbtw commented 4 years ago

I just noticed that this ticket is in the was-thumbnail repo. The way things are named in the WAS workflows is confusing. This issue should be in was_robot_suite since thumbnail-generator is robot step.

The was-thumbnail-service is not involved in that workflow. The thumbnails it creates are different.

Never mind. I was confused. There are similar error messages in thumbnail-generator.

This is the correct repo. But since we're decommissioning the service, I don't think there's anything to do on this.

andrewjbtw commented 4 years ago

@jmartin-sul I don't seem to be able to close issues in this repo. Could you close now that was-thumbnail is off? Thanks!

jmartin-sul commented 4 years ago

@jmartin-sul I don't seem to be able to close issues in this repo. Could you close now that was-thumbnail is off? Thanks!

weird! closing...

jmartin-sul commented 4 years ago

@andrewjbtw i added you as a maintainer to this repo (hopefully just in time for it to be moved to sul-dlss-deprecated 😄)