Closed nvanderperren closed 4 years ago
Thanks for including the detailed info from the WARC. I think the main issue is that the resulting urls in the WARCs are invalid as they are missing the scheme (http:// or https://)
I've added an issue to automatically detect/fix this in warcit. For now, you can rerun warcit with for example:
warcit https://www.koophandeltongeren.be/ ././www.koophandeltongeren.be/
instead of
warcit www.koophandeltongeren.be/ ././www.koophandeltongeren.be/
I think that might fix it
oh! 🤦 that solved the problem indeed! Thanks for your clear answer!
I used
warcit
to create a webarchive of a website which was created with HTTrack years ago. When I load it into the Webrecorder Player I get this page and no pages are shown.An extract of the WARC-file:
I've also tried it with a download of a facebook page which gave the same result.