AtuboDad / playwright_stealth

playwright stealth
MIT License
541 stars 71 forks source link

Datadome scraping #5

Closed goldenking0412 closed 3 years ago

goldenking0412 commented 3 years ago

I just noticed major features are in js files. So I've implemented this using playwright-ruby-client gem. At first it was working, but now the website started detecting and never allow me to scrape it.

The website is https://en.oui.sncf

Thank you

goldenking0412 commented 3 years ago

When I visit below page using playwright stealth, https://antoinevastel.com/bots/datadome - it shows capcha

goldenking0412 commented 3 years ago

I am running playwright scraper in windows. About 5 scraping requests at once. Did npm install playwright to install playwright in windows

And it works at first. But after a few mins or few requests, I am getting error Resource temporarily unavailable - ./node_modules/.bin/playwright run-driver

Can you please check what's wrong with this?