-
Many modules contain non-Java files in the `src/main/java` hierarchy, and miscellaneous XML and settings files in the root directory or in other odd places. These should be moved to more appropriate …
-
**Describe the bug**
Annotated PDF giving exception when exported as UIMA CAS XMI (XML 1.0) "Export failed: SAXParseException: Trying to serialize non-XML 1.0 character: 0x1 at offset .." but can be…
-
Hi,
We have a tagging dilema, and we could use your advice.
We tag a transcription text according to audio files (of about an hour each). Up until now, we uploaded files into Inception, tagged the …
-
Originally reported on Google Code with ID 26
```
The files page.bin, revision.bin and text.bin, which are generated during the dump creaton
process, should be automatically deleted when processing h…
-
The transition from the "javax" namespace to "Jakarta" in the Java ecosystem is a significant development with far-reaching implications.
This issue aims to move JWPL to "jakarta.*".
JWPL mainly…
-
**Describe the bug**
External recommender fails when CAS contains control characters.
**To Reproduce**
Steps to reproduce the behavior:
1. Create a document with content 第四卷第一四二八页。 �
2. Config…
-
This is a follow up to #748 for new features that are implemented after 1.9.0.
Upgrading CoreNLP in DKPro Core at times was a bit painful in the past as we duplicated quite a bit of the functionali…
-
**Is your feature request related to a problem? Please describe.**
I encountered a situation where I had uploaded a document with character 0xc (form feed ascii control character), attempted to expor…
-
### Describe the bug
Hi there,
as described [#3605](https://github.com/inception-project/inception/issues/3605), we are trying to export the INCEpTION TypeSystem such that it allows UIMA subtypes,…
-
**Describe the bug**
When we process annotated documents exported from Inception as JSON files, we use the begin-end offsets of each span to identify words and phrases. In some annotated documents, o…