clarkkev / deep-coref

270 stars 59 forks source link

ssplit.eolonly rises NullPointerException at edu.stanford.nlp.pipeline.NERCombinerAnnotato #20

Open verosol opened 4 years ago

verosol commented 4 years ago

So, basically we have an already tokenised corpus with golden sentence segmentation, which we want to preserve. Evidently, we found this parameters : tokenize.whitespace = true ssplit.eolonly = true

They work alright together with tokenize,ssplit,pos,lemma and parses, but it we want to pass all the annotators needed for the coreference resolution annotators = tokenize,ssplit,pos,lemma,ner,parse,coref

it gives error Nullpointer exception specifically on NER annotation part.

Processing file /Users/nikahelicopter/Dropbox/data/new_gold/txt/xx00.txt ... writing to /Users/nikahelicopter/Downloads/stanford-corenlp-full-2018-10-05/xx00.txt.xml Exception in thread "main" java.lang.NullPointerException at edu.stanford.nlp.pipeline.NERCombinerAnnotator.annotate(NERCombinerAnnotator.java:322) at edu.stanford.nlp.pipeline.AnnotationPipeline.annotate(AnnotationPipeline.java:76) at edu.stanford.nlp.pipeline.StanfordCoreNLP.annotate(StanfordCoreNLP.java:637) at edu.stanford.nlp.pipeline.StanfordCoreNLP.annotate(StanfordCoreNLP.java:647) at edu.stanford.nlp.pipeline.StanfordCoreNLP.processFiles(StanfordCoreNLP.java:1226) at edu.stanford.nlp.pipeline.StanfordCoreNLP.processFiles(StanfordCoreNLP.java:1060) at edu.stanford.nlp.pipeline.StanfordCoreNLP.run(StanfordCoreNLP.java:1326) at edu.stanford.nlp.pipeline.StanfordCoreNLP.main(StanfordCoreNLP.java:1389)

We use stanford-corenlp-full-2018-10-05 version 3.9.2 An example file: xx01.txt

Parameters: annotators = tokenize,ssplit,pos,lemma,ner,parse,coref tokenize.whitespace = true ssplit.eolonly = true coref.algorithm = neural file = /Users/nikahelicopter/Dropbox/data/new_gold/txt/xx00.txt