webrecorder / pywb

Core Python Web Archiving Toolkit for replay and recording of web archives
https://pypi.python.org/pypi/pywb
GNU General Public License v3.0
1.34k stars 207 forks source link

XML files not replaying with included XSL #862

Open Jmontgomery045 opened 10 months ago

Jmontgomery045 commented 10 months ago

Describe the bug

When capturing / replaying an XML file, if there is an XSL file referenced in-line within the XML, the capture / replay seems to fail.

Steps to reproduce the bug

  1. Capture this URL via conifer: https://news.onr.org.uk/sitemap_index.xml (this XML has an in line XSL file)
  2. Replay the warc in replayweb.page
  3. This will not work.
  4. Capture this URL via conifer: https://www.mirrorweb.com/sitemap.xml (this XML has no in line XSL file)
  5. Replay the warc in replayweb.page
  6. This will work.

Expected behavior

XML files with in-line XSL styling should replay as normal.