So, basically we have an already tokenised corpus with golden sentence segmentation, which we want to preserve. Evidently, we found this parameters :
tokenize.whitespace = true
ssplit.eolonly = true
They work alright together with tokenize,ssplit,pos,lemma and parses, but it we want to pass all the annotators needed for the coreference resolution
annotators = tokenize,ssplit,pos,lemma,ner,parse,coref
it gives error Nullpointer exception specifically on NER annotation part.
Processing file /Users/nikahelicopter/Dropbox/data/new_gold/txt/xx00.txt ... writing to /Users/nikahelicopter/Downloads/stanford-corenlp-full-2018-10-05/xx00.txt.xml
Exception in thread "main" java.lang.NullPointerException
at edu.stanford.nlp.pipeline.NERCombinerAnnotator.annotate(NERCombinerAnnotator.java:322)
at edu.stanford.nlp.pipeline.AnnotationPipeline.annotate(AnnotationPipeline.java:76)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.annotate(StanfordCoreNLP.java:637)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.annotate(StanfordCoreNLP.java:647)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.processFiles(StanfordCoreNLP.java:1226)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.processFiles(StanfordCoreNLP.java:1060)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.run(StanfordCoreNLP.java:1326)
at edu.stanford.nlp.pipeline.StanfordCoreNLP.main(StanfordCoreNLP.java:1389)
We use stanford-corenlp-full-2018-10-05 version 3.9.2
An example file:
xx01.txt
So, basically we have an already tokenised corpus with golden sentence segmentation, which we want to preserve. Evidently, we found this parameters : tokenize.whitespace = true ssplit.eolonly = true
They work alright together with tokenize,ssplit,pos,lemma and parses, but it we want to pass all the annotators needed for the coreference resolution annotators = tokenize,ssplit,pos,lemma,ner,parse,coref
it gives error Nullpointer exception specifically on NER annotation part.
Processing file /Users/nikahelicopter/Dropbox/data/new_gold/txt/xx00.txt ... writing to /Users/nikahelicopter/Downloads/stanford-corenlp-full-2018-10-05/xx00.txt.xml Exception in thread "main" java.lang.NullPointerException at edu.stanford.nlp.pipeline.NERCombinerAnnotator.annotate(NERCombinerAnnotator.java:322) at edu.stanford.nlp.pipeline.AnnotationPipeline.annotate(AnnotationPipeline.java:76) at edu.stanford.nlp.pipeline.StanfordCoreNLP.annotate(StanfordCoreNLP.java:637) at edu.stanford.nlp.pipeline.StanfordCoreNLP.annotate(StanfordCoreNLP.java:647) at edu.stanford.nlp.pipeline.StanfordCoreNLP.processFiles(StanfordCoreNLP.java:1226) at edu.stanford.nlp.pipeline.StanfordCoreNLP.processFiles(StanfordCoreNLP.java:1060) at edu.stanford.nlp.pipeline.StanfordCoreNLP.run(StanfordCoreNLP.java:1326) at edu.stanford.nlp.pipeline.StanfordCoreNLP.main(StanfordCoreNLP.java:1389)
We use stanford-corenlp-full-2018-10-05 version 3.9.2 An example file: xx01.txt
Parameters: annotators = tokenize,ssplit,pos,lemma,ner,parse,coref tokenize.whitespace = true ssplit.eolonly = true coref.algorithm = neural file = /Users/nikahelicopter/Dropbox/data/new_gold/txt/xx00.txt