xrma / crawler4j

Automatically exported from code.google.com/p/crawler4j
0 stars 0 forks source link

Unable to crawl https urls using crawl4j #167

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1.Use any https based url 
2.
3.

What is the expected output? What do you see instead?
I want to get the page source.

What version of the product are you using?
Latest version

Please provide any additional information below.
I get a Fatal transport error.
ERROR [Crawler 1] Fatal transport error: null while fetching https://...

Please help.. Its really important.
Thanks in advance

Original issue reported on code.google.com by arpita.i...@gmail.com on 17 Jul 2012 at 12:53

GoogleCodeExporter commented 9 years ago
Same problem for me. Using crawler4j v3.4, JDK 1.6, Mac OS X 10.8.4.

Original comment by apenneba...@42six.com on 9 Sep 2013 at 6:22

GoogleCodeExporter commented 9 years ago
CrawlConfig config = new CrawlConfig();
config.setCrawlStorageFolder(crawlStorageFolder);
config.setIncludeHttpsPages(true); // This enables HTTPS

Original comment by avrah...@gmail.com on 11 Aug 2014 at 11:15