Conversion from document annotations to IBO and back:
initially only support for token-aligned annotations, non-overlapping
running ann2ibo sets a feature on each token with the ibo label(s) or sets several features, one for each type
do this in two steps: 1 create a list of labels or label lists or lists of labels which correspond to a list of token annotations 2 - apply the labels from the list(s) to the annotations (optional, if we use this e.g. for training, only the first step is needed)
running ibo2ann takes a list or lists of labels and applies them to a list of annotations
File format conversion: conll to our annotations:
See also: https://lingpipe-blog.com/2009/10/14/coding-chunkers-as-taggers-io-bio-bmewo-and-bmewo/
See also: https://github.com/GateNLP/corpusconversion-conll2003