inception-project / inception

INCEpTION provides a semantic annotation platform offering intelligent annotation assistance and knowledge management.
https://inception-project.github.io
Apache License 2.0
593 stars 151 forks source link

Invalid UIMA CAS XMI XML 1.1 #4153

Closed vishalmodagekar closed 1 year ago

vishalmodagekar commented 1 year ago

Describe the bug Annotated PDF giving exception when exported as UIMA CAS XMI (XML 1.0) "Export failed: SAXParseException: Trying to serialize non-XML 1.0 character: 0x1 at offset .." but can be exported as UIMA CAS XMI (XML 1.1). However, this XML has invalid xmlChars. Not able to parse the XML file using python package 'dkpro-cassis' functionality 'load_cas_from_xmi'. Throwing exception 'XMLSyntaxError: xmlParseCharRef: invalid xmlChar value ..'. Attached is the exported xmi file and typesystem xml for reference. file-2_10.1177_0333102421989620.zip

reckart commented 1 year ago

Try using v29.0.

reckart commented 1 year ago

Assuming this was a duplicate of https://github.com/inception-project/inception/issues/4058 and v29 works.