Closed tungdx closed 8 years ago
What is it you want to correct? What is your issue/problem? That TODO is not an issue. :-) It is merely questioning whether an extra event should be fired for those having registered event listeners with the crawler.
If you suspect you have documents that were rejected for invalid reasons, check the logs for the exact cause. You can also change the log level to DEBUG in the log4j.properties files on the rejections you are interested it to (sometimes) get more information. E.g.:
log4j.logger.CrawlerEvent.REJECTED_FILTER=DEBUG
Sorry for this question. I will debug it more carefully. Thanks.
In AbstractCrawler.java class, my crawler ran into processNextQueuedCrawlData() method and reached to a case has your TODO message "Fire an event here? If we get here, the importer did not kick in". It's happened with just some websites, some others worked well.
Here is my config for crawlers:
How can I correct it? Thanks in advance!