berstend / puppeteer-extra

💯 Teach puppeteer new tricks through plugins.
https://extra.community
MIT License
6.23k stars 732 forks source link

What are all you guys using pupeteer for? #217

Closed Aditya94A closed 3 years ago

Aditya94A commented 4 years ago

I'm just curious to know what all everyone is scraping 😉 There isn't much of a community around it so thought this is the best place to discuss/

dantgal commented 4 years ago

Google search results - but it's impossible now using this plugin - Google changed reCaptcha and no chance to solve it

Aditya94A commented 4 years ago

Interesting, would love to know your use case. How exactly are you using Google search results? In a derivative product or some internal tool or something else?

Assuming you're talking about this issue, they seem to have a workaround.

dench1k commented 4 years ago

I make music mixes every month and post it on a couple of platforms including youtube. There is a thing called music policies, where you can search for a track and get the info about it's license (i.e. can you actually use it without restrictions or it can cause ban). So I made a bot, which logs in the gmail account and searches via this service through my tracklist, tells me the info about a track and makes a screenshot to check this info.

seuaCoder commented 4 years ago

Google search results - but it's impossible now using this plugin - Google changed reCaptcha and no chance to solve it

Did you find any alternative ?

dantgal commented 4 years ago

nope and I will be very happy when recaptcha plugin for puppeteer is upgraded cause alternatives are very expensive or maybe better statement: too expensive for me :)

Ross1337 commented 4 years ago

https://github.com/danielgatis/puppeteer-recaptcha-solver , here is a free solver working (for me lol)

dantgal commented 4 years ago

But it's for old style reCaptcha (embedded) for such things puppeteer works like a charm too

Ross1337 commented 4 years ago

U need v3 ?

dantgal commented 4 years ago

yuup! Tried https://github.com/danielgatis/puppeteer-recaptcha-solver but it also failed: https://postimg.cc/kD24p1Hb - tough times for google serp scrapers :(

lmourabraga commented 4 years ago

I have been using to scrap all images of the e-commerce of the company I work for and save each file naming with SKU of their respective products in order to centralize all these images in an API we created for our products.

lmourabraga commented 4 years ago

yuup! Tried https://github.com/danielgatis/puppeteer-recaptcha-solver but it also failed: https://postimg.cc/kD24p1Hb - tough times for google serp scrapers :(

Dantgal, regarding your situation there are not solutions known yet for reCaptcha v3 once it was developed to not be beaten by an automatization. Actually you will always need a human solver for it. As you can see, the solutions we have over the internet depends on the human hands to deliver their solution. For instance: https://2captcha.com/?from=9148648 see the content for: Work for 2Captcha.

dantgal commented 4 years ago

Yes, you are absolutely right! I have used 2captcha with puppeteer until it was possibe - now it's not :(

radoslavkarlik commented 4 years ago

Browser game bot. More reliable than simple http requests :D Just fun personal project.

cnscorpions commented 3 years ago

I scrape google serps with Puppeteer, but now google ban my bot. :(

cnscorpions commented 3 years ago

@dantgal Hi, just wonder how to identify recap version? I'm totally confused.

itsdarrylnorris commented 3 years ago

@AdityaAnand1

I'm just curious to know what all everyone is scraping 😉 There isn't much of a community around it so thought this is the best place to discuss

We have a discord server with a community around scraping/puppeteer/playwright. Here is the discord invite -> https://discord.gg/vz7PeKk

cnscorpions commented 3 years ago

@itsdarrylnorris thanks

dantgal commented 3 years ago

@dantgal Hi, just wonder how to identify recap version? I'm totally confused.

Hello Gavin! Sorry but I gave up with puppeteer, too many sleepless nights to fight with it :(

cnscorpions commented 3 years ago

@dantgal okay, good luck. it's hard work for me too. I am working on a google scraping project overnight. :( oh shit, Google bans all my proxy IPs.

hugh-bowie commented 3 years ago

@AdityaAnand1 - onlyFans automation for a growing 'influencer'. Those girls make serious cash

berstend commented 3 years ago

Closing this for now 😄

There isn't much of a community

Not anymore! Our discord has an active and cool community (which is better suited for these types of discussion): https://github.com/berstend/puppeteer-extra/wiki/Scraping-Chat