mirrorweb / pywb

Core Python Web Archiving Toolkit for replay and recording of web archives
https://pypi.python.org/pypi/pywb
GNU General Public License v3.0
1 stars 2 forks source link

Inconsistent behaviour in the archived site. #68

Open gsinghg19 opened 1 year ago

gsinghg19 commented 1 year ago

Describe the bug

When testing in a local pywb environment, the site is shown to work consistently, with the help of the developed content scripts. But inconsistent behaviour is shown when a crawl is done accessed. Sometimes the instance will show a blank screen, then when refreshed or the link is re-opened again, the site loads up. Links that were previously working may then decide not to work. Console.log's from my code are present in the dev tools, when locally running the site in pywb. But when looking for the same console.logs in the new marrionette instances, they are not present. Older instances show no effect and seem to be working, as expected. To note, only the most recent crawls seem to be affected. Possibly a cdx/indexing issue?

Steps to reproduce the bug

Access the most recent instance in marionette:

  1. Open the archive to the latest instance
  2. Navigate to News and Statements, via the News and Campaigns tab in the main navbar
  3. Access any pages on the page.
  4. Some pages will appear blank but accessible when refreshed. Others will load in as expected.

Expected behaviour

When a link is clicked the corresponding page is open.

Screenshots

Screenshot 2023-01-20 at 10 59 29 (2) Screenshot 2023-01-20 at 10 59 32 (2)

Environment

Additional context

Clients Profile Link: https://app.mirrorweb.com/management/web/16cac3a5-4135-4750-b8fe-657a3faa7a7e/profile/aec04c0d-4c98-4eb1-a491-2e10f306c99c/ Jira ticket: https://mirrorweb.atlassian.net/browse/CS-1326