The DBpedia Spotlight container applies a list of stopwords before running the actual disambiguation. This may result in a deviation between the result and the input, if a CWB corpus is our point of departure and if named entity starts or ends with a stop word.
Solution: reconstruct original region from corpus position.
Fixed. Note: Latest version of polmineR also addresses this issue, because as.AnnotatedPlainTextDocument() takes in stop words and NE regions may be futile.
The DBpedia Spotlight container applies a list of stopwords before running the actual disambiguation. This may result in a deviation between the result and the input, if a CWB corpus is our point of departure and if named entity starts or ends with a stop word.
Solution: reconstruct original region from corpus position.