sul-dlss / was-pywb

Configuration for Stanford's pywb instance
https://swap.stanford.edu
Other
2 stars 0 forks source link

Replay of page with javascript never completes #109

Open lwrubel opened 1 year ago

lwrubel commented 1 year ago

Pywb never completes loading pages from https://swap.stanford.edu/was/*/http://bondholder-information.stanford.edu/index.html. The console shows the page continually trying to load various javascript files.

The page renders in openwayback, likely because it is not handling the javascript.

It's unclear so far what is causing the problem with replay. This ticket is to describe what's known about the pywb bug so far. It prevents viewing the site and capturing a thumbnail for its seed.

lwrubel commented 1 year ago

@peterchanws will run a capture with webrecorder and accession that crawl.

peterchanws commented 1 year ago

Autopilot didn't run properly at https://bondholder-information.stanford.edu/. I checked archived pages in AIT and found the site was not captured properly. https://wayback.archive-it.org/5591/20220429054108/https://bondholder-information.stanford.edu/ I tried patching several times and some images are still not aligning properly. I have reported the issue to AIT.

peterchanws commented 1 year ago

I archived the site using Browsertrix Crawler. Accession it using one time registration. Manually created a thumbnail. Here is the results: https://argo.stanford.edu/view/druid:jn493fq7015 https://swap.stanford.edu/was/20220729152735/https://bondholder-information.stanford.edu/

edsu commented 1 year ago

I'm noticing that the page displays (yay) but that there appears to be some JavaScript code injected during replay into the page?! Scan the replayed page for

However, _____WB$wombat$check$this$function_____(this) Section