sul-dlss / was-registrar-app

Rails app to organize downloaded web archiving data and trigger preassembly/accessioning when appropriate
0 stars 0 forks source link

Item not registered properly #494

Closed peterchanws closed 2 years ago

peterchanws commented 2 years ago

https://argo.stanford.edu/view/druid:bq424qf7386

Thumbnail generation error occurred. I re-run the thumbnail-generator, the error disappear. However, the thumbnail is not a correct one and there is no links to the archived site. I found the archived sites via swap.stanford.edu Screen Shot 2022-07-14 at 11 48 45 AM Screen Shot 2022-07-14 at 11 48 32 AM Screen Shot 2022-07-14 at 11 51 37 AM

peterchanws commented 2 years ago

https://argo.stanford.edu/view/druid:by742qp8508 For this one, thumbnail generated correctly after re-run. No links to archived page in purl https://purl.stanford.edu/bq424qf7386 Screen Shot 2022-07-14 at 12 38 30 PM Found in SWAP https://swap.stanford.edu/was/*?url=https%3A%2F%2Fwww.bakersfieldcity.us%2F890%2FPolicies-and-Procedures Screen Shot 2022-07-14 at 12 39 13 PM

peterchanws commented 2 years ago

https://argo.stanford.edu/view/druid:cp247mq3677 Thumbnail regenerated fine. No links to archived page in purl https://purl.stanford.edu/cp247mq3677

lwrubel commented 2 years ago

The first example is a PDF, looks to be experiencing the thumbnail generation problem described in https://github.com/sul-dlss/was_robot_suite/issues/478.

The second item, https://argo.stanford.edu/view/druid:by742qp8508 has a Content Type of "file" rather than "webarchive-seed". Because it does not have a type of webarchive-seed, the sul-embed viewer for webarchive seeds doesn't get used in PURL. It gets treated as a file with a preview. Maybe the menu was set to something else when it was registered? Or do you know more about the history of this one?

I'm looking at the other link included, https://purl.stanford.edu/bq424qf7386, which is a different site (a PDF) and its record also has a content type of file rather than webarchive-seed.

peterchanws commented 2 years ago

Thanks, Laura. All the examples come from the thumbnail generation error queue. They are a mix of items accessioned from recent to many years. I will re-register them if the correct file type.