S1M0N38 / soccerapi

soccerapi, an unambitious soccer odds scraper ⚽️
MIT License
154 stars 37 forks source link

Problem scraping Bet365 with headless mode #38

Open victornoleto opened 2 years ago

victornoleto commented 2 years ago

I know that my question is not exactly related to the repository, but I thought it was worth sharing.

I created a thread on stackoverflow and there extremely detailed what my problem is when web scraping the Bet365 site.

Thanks if you can take a look.

https://stackoverflow.com/questions/70883112/problem-scraping-bet365-with-headless-mode-python-selenium-puppeteer

S1M0N38 commented 2 years ago

Have you already tried soccerapi-server ?

victornoleto commented 2 years ago

Isn't this server just for generating a new X-Net-Sync-Term?

S1M0N38 commented 2 years ago

Yes, but you can use the same technology (puppeteer noheadless inside docker container with virtual display) to achieve your goals. Moreover the stealth plugin improve anti-bot detection

Colombo97 commented 2 years ago

@victornoleto Did u solve the issue?

HMaker commented 2 years ago

In last changes bet365 tries to detect headless chrome but puppeteer-stealth is able to bypass their naive detection.

Colombo97 commented 2 years ago

@HMaker Thank you for feedback, i'll try it

victornoleto commented 2 years ago

Unfortunately I gave up on finding a solution. I didn't get to test the solution proposed by @HMaker.

Colombo97 commented 2 years ago

Lets dicuss about it add me on discord: Colombo97#9666