Closed Daedo closed 4 years ago
Do you have a minimal CAS that triggers this error?
This might not be the smallest cas, but is is the smallest where I saw this error occur:
Case 1: Id reassignment breaks nested feature structures The cas contains what is basically a linked list:
Thank you for reporting and using cassis! I fixed it hopefully in master by keeping IDs when loading XMI, I cannot recall why I regenerated them. You may want to try it out. You can just use the master via pip using python -m pip install git+https://github.com/dkpro/dkpro-cassis
. I will make a release soonish.
How do you ensure that new FSes get do not get IDs that have already been used?
I save the maximum id and generate from there on.
Is there some kind of controlled failure / error message when the IDs overflow?
How do they overflow? In python, numbers cannot overflow but convert to a bigint equivalent.
Ok. It could be that UIMA (Java) uses int for XMI IDs. Having had a cursory look at the UIMA code, I expect that XMI IDs > MAX_INT and possibly such smaller 0 might cause the UIMA Java deserialization to fail.
No further input, closing.
Describe the bug During the Serialization of a cas to xmi annotations are assigned new ids. This creates two issues:
To Reproduce Steps to reproduce the behavior:
For 1:
For 2:
Expected behavior The annotation id (xmiid) should not change during serialization..
Please complete the following information:
Additional context I'm trying to write a recommender for inception and noticed that the cas returned by the recommender is malformed.