Udapi now supports the bracketing "GUM-style" format of coreference annotations by default.
The CorefUD 0.1 (and 0.2) style format, is supported using a specialized reader (read.OldCorefUD) and writer (write.OldCorefUD).
The API has been changed just slightly (more more changes and renamings are planned):
CorefMention.__init__ has words as the second parameter (after self) because mention.words cannot be empty anymore.
mention.other (DualDict) instead of mention.misc (str)
ordering of mentions was re-defined (CorefMention.__lt__) so that longer mentions go first (if starting on the same node)
BridgingLinks serialization follows the new format (and link.relation and link.target are now mutable).
document.meta['global.Entity'] reflects the global.Entity header and can be used for reading and writing (so that the writer can use different positional attributes of entities annotations).
Udapi now supports the bracketing "GUM-style" format of coreference annotations by default. The CorefUD 0.1 (and 0.2) style format, is supported using a specialized reader (
read.OldCorefUD
) and writer (write.OldCorefUD
).The API has been changed just slightly (more more changes and renamings are planned):
CorefMention.__init__
haswords
as the second parameter (afterself
) becausemention.words
cannot be empty anymore.mention.other
(DualDict) instead ofmention.misc
(str)CorefMention.__lt__
) so that longer mentions go first (if starting on the same node)BridgingLinks
serialization follows the new format (andlink.relation
andlink.target
are now mutable).document.meta['global.Entity']
reflects theglobal.Entity
header and can be used for reading and writing (so that the writer can use different positional attributes of entities annotations).