quadrama / DramaNLP

UIMA NLP components for dramatic texts
Apache License 2.0
9 stars 3 forks source link

Include annotated coreference #40

Closed pagelj closed 6 years ago

pagelj commented 6 years ago

Integrate the .xmi annotations of coreference into the pipeline.

nilsreiter commented 6 years ago

My plan was to export the annotations from XMI into TEI, and then to read them in into the pipeline. The exporting to TEI has not happened yet, though.

pagelj commented 6 years ago

XMI -> TEI export is now addressed in https://github.com/quadrama/DramaNLP/issues/46

nilsreiter commented 6 years ago

Mentions: <rs ref="">Sara</rs>

nilsreiter commented 6 years ago

Annotated disagreements: <rs ref="e1 e2" func="or">bla</rs>

nilsreiter commented 6 years ago
pagelj commented 6 years ago

Another possible value is func="and", which is the default and can also be achieved by omitting func

pagelj commented 6 years ago

Aggregated mentions, i.e. mentions that point to more than one entity are simply treated as two separate entities with the same span, e.g. they,(CATO)|(ARSENE),(cato)|(arsene) in the Mentioned.figure_surface and Mentioned.figure_id column