-
### Describe the bug
How did you download and install the software? `MacPorts` (BTW not offered in the drop-down menu below...)
Run `ocrmypdf bid\$pdf bid_.pdf`
=> "crash" on this particular file `…
-
Hi
Thanks for developing Sumatra PDF reader. I was very excited to finally get PDF annotations released in version 3.3. Thank you for the hard work!
One feature that I'm missing though is the ab…
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain documentation with the integrated search.
- [X] I used the GitHub search to find a sim…
-
There will need to be two products from the PDF build: one will be a download-ready PDF that has the covers just inserted at the beginning and the end, and the other will be a cover-free PDF that's de…
-
Hello, this looks like a really useful tool. However, it depends on pdftk, which is no longer maintained. For example, Fedora retired this package six years ago: https://src.fedoraproject.org/rpms/pdf…
-
It would be nice to be able to set something like `--auto-size` when calling wkhtmltopdf to generate a PDF with one page of a minimum possible size. This can be currently done [using a workaround](htt…
-
First of all, I love this, thanks for creating s3-ocr!
Everything works as it should, following the instructions in your TIL https://simonwillison.net/2022/Jun/30/s3-ocr/
...except that the very…
-
### Description
Hello,
I installed paperless on bare metal, I got no errors or anything else, but when I try to upload a file wit hthe webui, it looks like described here: [https://github.com/pa…
-
### Description
After uploading PDF files some files will not be processed, the following error occurs:
TypeError: Only scalar types, arrays, and dictionaries are allowed in content streams.
Th…
-
Hi,
i have to ocr mixed content PDF, example : 100 pages with vector text and shapes, then 100 pages with only image (from scan). If i force OCR i loose quality from layer so i decide to script li…