Closed GoogleCodeExporter closed 9 years ago
My config :
CrawlConfig config = new CrawlConfig();
config.setCrawlStorageFolder(rootFolder);
config.setMaxPagesToFetch(100000);
config.setPolitenessDelay(1);
config.setUserAgentString(" Mozilla/5.0 (iPad; U; CPU OS 3_2_1 like Mac OS X; en-us) AppleWebKit/531.21.10 (KHTML, like Gecko) Mobile/7B405");
PageFetcher pageFetcher = new PageFetcher(config);
RobotstxtConfig robotstxtConfig = new RobotstxtConfig();
robotstxtConfig.setEnabled(false);
RobotstxtServer robotstxtServer = new RobotstxtServer(robotstxtConfig, pageFetcher);
CrawlController controller = new CrawlController(config, pageFetcher, robotstxtServer);
controller.addSeed(http://www.morhipo.com);
controller.start(Morhipo.class, numberOfCrawlers);
System.out.println("hello");
Original comment by muhammet...@gmail.com
on 10 Mar 2014 at 12:23
Original comment by avrah...@gmail.com
on 18 Aug 2014 at 3:50
Tested it with the basic crawler example.
It works!
Everything gets crawled (except several pages which get server 500 error)
Please try again with latest build from trunk and report back.
Original comment by avrah...@gmail.com
on 20 Aug 2014 at 12:37
Closed due to inactivity and no good scenario
Original comment by avrah...@gmail.com
on 23 Sep 2014 at 2:13
Original issue reported on code.google.com by
muhammet...@gmail.com
on 10 Mar 2014 at 12:22