BayanGroup / nutch-custom-search

65 stars 34 forks source link

NPE trying to index #15

Closed dmnt3rr0r closed 9 years ago

dmnt3rr0r commented 9 years ago

I am trying to use extractor as a html/index filter but I am getting a NPE when its trying to load the config file despite the fact that I have an extractors.xml file in the conf directory. Here is the exact error:

java.lang.NullPointerException at ir.co.bayan.simorq.zal.extractor.nutch.ExtractorIndexingFilter.filter(ExtractorIndexingFilter.java:71) at org.apache.nutch.indexer.IndexingFilters.filter(IndexingFilters.java:50) at org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:291) at org.apache.nutch.indexer.IndexerMapReduce.reduce(IndexerMapReduce.java:53) at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:522) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)

Any thoughts about what is going on?

tahagh commented 9 years ago

Something has prevented your config file to be loaded. My first guess is that there might be some syntax errors in your extractors.xml file. Is there any error report in your nutch log file regarding this ?

dmnt3rr0r commented 9 years ago

Sorry, I didn't bother looking up 20 lines in the log. Thanks!