paulirish / headless-cat-n-mouse

Is headless chrome currently detectable? Let's pit the detections and detection evasions against eachother.
Apache License 2.0
640 stars 56 forks source link

Bloomberg detects you #22

Open Kikobeats opened 5 years ago

Kikobeats commented 5 years ago
screenshot 2019-01-27 at 00 23 50

Example URL

https://www.bloomberg.com/news/articles/2019-01-15/here-are-five-volatility-charts-keeping-wall-street-up-at-night

When you enter performing the request using puppeteer the website automatically redirects you to Terms of Service forcing you to verify you are a human with captcha checkbox.

NikolaiT commented 5 years ago

does somebody have an idea how to prevent this?

does bloomberg track mouse movement?

franz101 commented 5 years ago

@NikolaiT From a short inspection they use perimeterx for bot detection.

I was able to fetch the document with the following header: let url = 'https://www.bloomberg.com/news/articles/2019-01-15/here-are-five-volatility-charts-keeping-wall-street-up-at-night' var headers = { 'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8', 'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_14_3) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/12.0.3 Safari/605.1.15', 'Accept-Language': 'en-us', 'Accept-Encoding': 'br, gzip, deflate', 'Connection': 'keep-alive' } let data = await fetch(url,{ method: 'GET', headers: headers}) let body = await data.text()

As you can see it works by simply changing the user agent to: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_14_3) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/12.0.3 Safari/605.1.15

@paulirish let me know if that works.

Kikobeats commented 5 years ago

I tested your code snippet and it doesn't work to me 🤔

momala454 commented 4 years ago

it detect the IP