hartator / wayback-machine-downloader

Download an entire website from the Wayback Machine.
Other
5.36k stars 713 forks source link

Decompress gzip content #262

Open Forage opened 1 year ago

Forage commented 1 year ago

WBM is applying compression. "gzip;q=0, identity" as accepted-encoding is being ignored, so decompressing retrieved content. Fixes #259.

grandpa1946 commented 1 year ago

when using this, file urls won't be decoded...

Forage commented 1 year ago

@grandpa1946 do you have an example?

grandpa1946 commented 1 year ago

@grandpa1946 do you have an example?

sorry, i think it has to do with the windows naming scheme...

grandpa1946 commented 1 year ago

it's kinda annoying because urls will be double encoded, but i can't do anything about that ofcourse

Forage commented 1 year ago

So there is or isn't an issue? If there is, please provide an example because it's working nicely for me.

grandpa1946 commented 1 year ago

So there is or isn't an issue? If there is, please provide an example because it's working nicely for me.

i don't think there is, because i think it has the same results as the previous version, sorry that i let you wait.

VioletFlare commented 11 months ago

Hello, I just tested this wonderful patch and I confirm that it works on ubuntu. Thank you a thousand times! :))) 👍

Forage commented 10 months ago

This PR would become redundant if https://github.com/hartator/wayback-machine-downloader/issues/267#issuecomment-1868090089 is integrated into the code, as Net::HTTP decompresses automagically.