palewire / archiveis

A simple Python wrapper for the archive.is capturing service
https://archive.is
MIT License
188 stars 16 forks source link

Bypass cloudflare anti-bot page #4

Closed CaioWzy closed 6 years ago

CaioWzy commented 7 years ago

This is what I get when trying to capture a page from an Amazon EC2 instance. I routed my web traffic using SOCKS Tunnel with SSH and tried to access to archive.is then I got something like this. As requests library isn't able to solve cloudflare javascript challenge I added an alternative to it, cfscrape, which works the same way as requests library does but is able to solve those js challenges along with nodejs.