sayem314 / hooman

http interceptor to hoomanize cloudflare requests
https://www.npmjs.com/package/hooman
MIT License
148 stars 18 forks source link

Improved User Agent emulation #40

Open msjonker opened 3 years ago

msjonker commented 3 years ago

This applies techniques from https://github.com/VeNoMouS/cloudscraper to more closely mimic requests from a browser, so there is a better chance of not being detected as a bot to have to deal with a captcha in the first place. In my testing, these changes created identical requests to cloudscraper, allowing me to access pages that were previously protected with a captcha.

Summary of changes:

For my purposes, I needed to be able to scrape https://bouqs.com/, but without these changes, I was blocked by a challenge. After these changes, requests go through with no challenge. I'm not terribly familiar with the Cloudflare firewall, but I know they compute a "Bot Threat Score" and I think these changes help to lower it enough that, at least in my case, it allows my requests through.