internetarchive / wayback

IA's public Wayback Machine (moved from SourceForge)
750 stars 134 forks source link

302 redirect with relative path location breaks page save #191

Open wesinator opened 5 years ago

wesinator commented 5 years ago

Seems a 302 redirect with relative path locations, e.g. Location: /dest_page.php does not save the destination properly. The save returns an error because wayback tries to resolve the location without adding the parent host to making it absolute.

Unfortunately I can't share an example of this at the moment

Expected: wayback resolves absolute location from original parent host

bartman081523 commented 5 years ago

probably a duplicate of this issue:

a javascript on the original page is embedded like this:

in the archived page, the javascript is embedded from the root of the web.archive.org domain, so it throws a 404 GET http://web.archive.org/static/0.50.4036/js/_crucial-behavior.pkg.js net::ERR_ABORTED 404 (NOT FOUND)

this is the specific example: http://web.archive.org/web/20190404141202/https://www.zdf.de/dokumentation/37-grad/37-rente-unter-palmen-100.html Expected: ait-client-rewrite.js rewrites path locations from original root domain