dkpro / dkpro-tc

UIMA-based text classification framework built on top of DKPro Core and DKPro Lab.
https://dkpro.github.io/dkpro-tc/
Other
34 stars 19 forks source link

Add begin/end of annotation-sequence markers for character ngrams calculation #219

Closed daxenberger closed 9 years ago

daxenberger commented 9 years ago

Originally reported on Google Code with ID 219

The method call of 
NGramUtils.getAnnotationCharacterNgrams()
should create character ngrams and mark the beginning/end of the span that is split
(e.g. marking what occurs as prefix in a sequence what as suffix or infix (no-marking)

Reported by Tobias.Horsmann on 2014-12-04 16:38:55

daxenberger commented 9 years ago

Reported by Tobias.Horsmann on 2014-12-04 16:39:29

daxenberger commented 9 years ago

Reported by daxenberger.j on 2015-04-01 17:10:01