dkpro / dkpro-core

Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
https://dkpro.github.io/dkpro-core
Other
194 stars 68 forks source link

Support xml id on certain TEI elements #1570

Closed reckart closed 8 months ago

reckart commented 10 months ago

Is your feature request related to a problem? Please describe. When dealing TEI data, the xmi:id field is often used to keep track of correlated annotations. However, the TeiReader and TeiWriter discard that attribute except on the document itself.

Describe the solution you'd like Some DKPro Core types like token, sentence and div have a feature to store an external ID. It would be nice if the TEI elements which are mapped to these types retain their xml:id in this field.

Additional context Add any other context or screenshots about the feature request here.