-
tjbck updated
1 month ago
-
First of all, thank you so much for this most valuable application.
I am unable to index the content on pdf files in Ubuntu 18.04.
This is the error while executing the index command `opensemant…
-
How do I supply a password for a password protected PDF?
I am getting this error:
```
*** Reading ./data/attachments/dasd.pdf
INFO - Document is encrypted
[Fatal Error] :1:1: Content is not allowed …
-
I am running paperless-ng `1.4.5` in a container. The consume directory is mounted via NFS so inotify does not work. I have set `PAPERLESS_CONSUMER_POLLING=60` with the expectation that the directory …
-
The hope here is to get TikaOnDotNet fully configured to access Tesseract OCR for text extraction from images. With Tika .93 support for Tesseract was added, and we are now in the midst of validating…
-
# Issue Description #
Good evening, @dadoonet. My name is Procopie Gabi and I decided to use a verification tool on your application for a project given by my university. I know that the bugs I f…
-
Write up mini-paper comparing performance of various text-extractors on a document with available plaintext (possibly a particular edition of the bible).
- [ ] Find popular samples with clean and accu…
-
Hi,
I'm getting this error :
`.../pdf2html@3.1.0/node_modules/pdf2html postinstall: throw new Error(`Failed downloading dependency ${filename}.`);
.../pdf2html@3.1.0/node_modules/p…
-
Set up paperless-ngx and mount it into Nextcloud.
-
I am trying to extract text from scanned pdf documents. It works fine for most of them except a couple I tested.
I am able to extract the metadata correctly but not the text in the pdf. It returns wi…