dkpro / dkpro-cassis

UIMA CAS processing library written in Python
https://pypi.org/project/dkpro-cassis/
Apache License 2.0
85 stars 22 forks source link

XMI is serialized using ASCII #165

Closed reckart closed 3 years ago

reckart commented 3 years ago

Describe the bug It seems cassis by default serializes as ASCII but it should use UTF-8 instead.

To Reproduce

cas = Cas(TypeSystem())
print(cas.to_xmi())

Expected behavior XMI should be serialized as UTF-8

Please complete the following information:

reckart commented 3 years ago

@jcklie didn't we have this issue with encoding recently already?

jcklie commented 3 years ago

Did you try it with 0.5.2 instead of 0.5.0?

reckart commented 3 years ago

GitHub says that 0.5.0 is the latest release?

reckart commented 3 years ago

I did a pip install --upgrade dkpro-cassis and it says that 0.5.2 is installed already, run the code and get the ASCII.

reckart commented 3 years ago

Fixed in 0.5.3.