-
**What should be done in the scope of this task?**
We should provide compatibility to Apache Tika 2+
-
## CVE-2018-8017 - Medium Severity Vulnerability
Vulnerable Library - tika-parsers-1.14.jar
Apache Tika is a toolkit for detecting and extracting metadata and
structured text content from variou…
-
**Name:**
Khudoley A. K., 2003: A GIS dataset of geological features for the Tika Creek map area (95C/10), Yukon Territory and Northwest Territories; Geological Survey of Canada, Open File 1777.
…
-
The current Tika pipeline keeps line break added by email servers in order to fit the `78/998` max line length RFC limit.
Ideally emails inside DS should display without these artificial line break.
…
-
**Describe the bug**
I am fresh installing paperless-ng using docker on raspberry pi 3b+ but even after multiple reinstalls from docker-compose as well as portainer I cant upload office file and …
-
## Description
I want to use the RAG feature (on pdf that I transformed with tika). I have a collection with two fields, `content` and `embedding`. The embedding is calculated with openai/text-embe…
-
**Describe the bug**
Getting ClassNotFoundException when executing tests with 3.2.11 release version.
Same tests were succeeding with v3.2.5
**To Reproduce**
Executed the internal tests with…
-
Latest is 1.20 (https://tika.apache.org/)
-
I have just uploaded a .msg (application/vnd.ms-outlook) file which contains some valuable e-mail text to Docspell.
Unfortunately, Docspell couldn't extract the contents.
Fri, July 8th, 2022, 11:…
-
### Chart Name
paperless-ngx
### Is your feature request related to a problem? Please describe.
To import mails into paperless ngx we need Tika and Gotenberg. Unfortunately, they are not incl…