-
Hi there, I have scoured the web for weeks trying to solve this, apologies if its something simple Im missing or Im barking up the wrong tree but I simply cannot get the package installed on a new App…
-
Currently, the core reading of PDF document is made with PyMuPDF. This needs to be benchmarked against alternatives to ensure we use the optimal backend here.
-
### Contact Details [Optional]
### System Information
Unable to find ZenML repository in your current working directory (/home/user/foo/bar) or any parent directories. If you want to use an exis…
-
I am building an iped on Linux Mint 21 (Liberica jdk 11.0.19 full, maven 3.9.2), the following error occurred while running the test run test:
[INFO] Tests run: 10, Failures: 0, Errors: 0, Skipped:…
-
I'm using Apache Tika to OCR a bunch of PDFs. When I use the GUI (by doing java -jar tika-app-1.22.jar) everything works fine: I go to "Recursive JSON" on the "View" menu and the text is all there (ev…
-
hi @chrismattmann
i want to know whether have existing docker file to deploy the tika-python to docker environment? i found a tika-python docker file in docker hub
https://hub.docker.com/layers/t…
-
### A few possible ways to go about this.
- Try some other Nodejs pdf parsing packages to see if there are any that are much more performant than PDF.js (pdf2json, pdfreader)
- Build a microservice i…
-
### Description
I'm having problems with PDF converting of Office documents on a fresh installation (via installation script).
Whenever I add an Office document (doesn't matter if I add it via the…
-
**Error message**
ValueError: Could not load model google/flan-t5-large with any of the following classes: (, ).
es / preprocessing steps / settings of reader etc.
**To Reproduce**
prompt_node =…
-
Should [TIKA-1982](https://issues.apache.org/jira/browse/TIKA-1982) be implemented such that the detected language will be available form the /rmeta endpoint, tika-python probably needs updating so th…