momer / nutch-selenium

Apache License 2.0
28 stars 20 forks source link

another question about https #6

Open slylockfox opened 7 years ago

slylockfox commented 7 years ago

I'd like to basically re-open issue #4 and ask for more detailed guidance on crawling https. Simply enabling protocol-httpclient in nutch-site.xml seems to bypass this plugin

amolshelar2002 commented 7 years ago

I am facing same issue, it is bypassing the selenium whenever I try to add http or httpclient plugin.

sashiIBM commented 6 years ago

Hi is there a solution to this issue? can the selenium plugin handle https?

hussein-alahmad commented 6 years ago

I created a pull-request to fix this issue you can see it here