yatish27 / linkedin-scraper

Scrapes the public profile of the linkedin page
MIT License
552 stars 221 forks source link

Still unable to use proxy IP #45

Closed cyberfab007 closed 8 years ago

cyberfab007 commented 9 years ago

Anyway we can get a proxy variable dropped in to this script please , I know how to do it php , it is so easy when using CURL, I mean it must be simper in ruby to do this!

cyberfab007 commented 9 years ago

http://ruby-doc.org/stdlib-2.0/libdoc/open-uri/rdoc/OpenURI/OpenRead.html

somthing like this ?

mlieber commented 9 years ago

Use the set_proxy() method on agent, i.e.: agent.set_proxy('proxy address', port, 'username', 'password')

cyberfab007 commented 9 years ago

Im sorry but I don't know how to code in ruby as of yet (though I have taken some tutorials, can you just update in the github release ? so its like this "linkedin-scraper LINKEDINURL proxy port " when you run the command

yatish27 commented 9 years ago

I guess you can buy proxies for the virtual server in which you running this bot. You can buy them at one place https://instantproxies.com

cyberfab007 commented 9 years ago

yatish27 I have over 500 proxies from not only instantproxies.com but others as well I use them for my lead generation for my business , but I can not use them with this scraper , I am learning ruby , but cant figure it out how to drop the proxy variable ,

you should be able to do this from the command line

linkedin-scaper PROFILEURL 10.40.43.35:8080

that would ideal

yatish27 commented 9 years ago

Let me look into it

On Mon, May 11, 2015 at 8:23 PM, Jeremy Fabiano notifications@github.com wrote:

yatish27 I have over 500 proxies from only instantproxies.com but others as well I use them for my lead generation for my business , but I can not use them with this scraper , I am learning ruby , but cant figure it out how to drop the proxy variable , you should be able to do this from the command line linkedin-scaper PROFILEURL 10.40.43.35:8080

that would ideal

Reply to this email directly or view it on GitHub: https://github.com/yatish27/linkedin-scraper/issues/45#issuecomment-101080358

mlieber commented 9 years ago

No worries I know how; I will try your proxy.

Le lundi 11 mai 2015, Yatish Mehta notifications@github.com a écrit :

Let me look into it

On Mon, May 11, 2015 at 8:23 PM, Jeremy Fabiano <notifications@github.com javascript:_e(%7B%7D,'cvml','notifications@github.com');> wrote:

yatish27 I have over 500 proxies from only instantproxies.com but others as well I use them for my lead generation for my business , but I can not use them with this scraper , I am learning ruby , but cant figure it out how to drop the proxy variable , you should be able to do this from the command line linkedin-scaper PROFILEURL 10.40.43.35:8080

that would ideal

Reply to this email directly or view it on GitHub:

https://github.com/yatish27/linkedin-scraper/issues/45#issuecomment-101080358

— Reply to this email directly or view it on GitHub https://github.com/yatish27/linkedin-scraper/issues/45#issuecomment-101082186 .

cyberfab007 commented 9 years ago

well thats not a real proxy I use here I just make that one up

cyberfab007 commented 9 years ago

besides my paid proxies , I also use proxies from here http://www.publicproxyservers.com/proxy/list1.html

cyberfab007 commented 9 years ago

here as well http://tools.rosinstrument.com/raw_free_db.htm?0&t=1

mlieber commented 9 years ago

Ok thanks. But we want to use dedicated proxies, not public ones, that's very unsafe, they record your queries. I am already using a private proxy, thanks.

On Mon, May 11, 2015 at 6:14 PM, Jeremy Fabiano notifications@github.com wrote:

here as well http://tools.rosinstrument.com/raw_free_db.htm?0&t=1

— Reply to this email directly or view it on GitHub https://github.com/yatish27/linkedin-scraper/issues/45#issuecomment-101086518 .

cyberfab007 commented 9 years ago

so you updated the script ?

mlieber commented 9 years ago

The proxy that I was using got blocked after a couple of 100 requests.. We need to find another solution.

Le mardi 12 mai 2015, Jeremy Fabiano notifications@github.com a écrit :

so you updated the script ?

— Reply to this email directly or view it on GitHub https://github.com/yatish27/linkedin-scraper/issues/45#issuecomment-101460953 .

cyberfab007 commented 9 years ago

humm, just use the public ones for just testing , I have lots of private ones , but they are being used to pull down the profiles right now and they behavior is set in such a way google is not bothering me right now

mlieber commented 9 years ago

Thanks, but it won't work. Linked in is very sensitive with proxies. I tried a lot of them already.

Le mardi 12 mai 2015, Jeremy Fabiano notifications@github.com a écrit :

humm, just use the public ones for just testing , I have lots of private ones , but they are being used to pull down the profiles right now and they behavior is set in such a way google is not bothering me right now

— Reply to this email directly or view it on GitHub https://github.com/yatish27/linkedin-scraper/issues/45#issuecomment-101484850 .

cyberfab007 commented 9 years ago

ok I will have some fresh proxies for you

cyberfab007 commented 9 years ago

just waiting for some paypal transfer from my bank , I will buy 50 more

mlieber commented 9 years ago

Wow - 50 may work indeed! I just need their IPs and the username and password. Cheers -

Le mardi 12 mai 2015, Jeremy Fabiano notifications@github.com a écrit :

just waiting for some paypal transfer from my bank , I will buy 50 more

— Reply to this email directly or view it on GitHub https://github.com/yatish27/linkedin-scraper/issues/45#issuecomment-101505996 .

cyberfab007 commented 9 years ago

do you have the script working ? with 50 IP's I should be able to pull about 500 profiles a hour

mlieber commented 9 years ago

Yes the script is working. I just need your list of IPs and port, username and pass. Thanks

Le mardi 12 mai 2015, Jeremy Fabiano notifications@github.com a écrit :

do you have the script working ? with 50 IP's I should be able to pull about 500 profiles a hour

— Reply to this email directly or view it on GitHub https://github.com/yatish27/linkedin-scraper/issues/45#issuecomment-101525243 .

cyberfab007 commented 9 years ago

add me on skype

cyberfab007 commented 9 years ago

cyberfab007

mlieber commented 9 years ago

what's your handle ? There are several -

On Wed, May 13, 2015 at 6:06 PM, Jeremy Fabiano notifications@github.com wrote:

add me on skype

— Reply to this email directly or view it on GitHub https://github.com/yatish27/linkedin-scraper/issues/45#issuecomment-101870192 .

cyberfab007 commented 9 years ago

cyberfab007 is my handle

bricemaurin commented 8 years ago

Hi there, any news on the possibility to use proxies ? this version allows that: https://github.com/alex-go/linkedin-scraper/blob/master/lib/linkedin-scraper/profile.rb. Should I fork or may i expect these modifs to be included at some point ? It seems like the proxy topic is coming regularly but it's never been merged into master

yatish27 commented 8 years ago

@deuxio Check out version 1.0.2

bricemaurin commented 8 years ago

Thanks a lot Yatish

yatish27 commented 8 years ago

Check latest version