-
In my company we just need full text search on PDFs that were already scanned and converted into Text-PDFs - so no OCR needed.
And OCR was disabled in /etc/opensemantic/etl and the ETL service was re…
-
the imported JPGs seem like the original images with margins/background, while the scanned pdfs are cropped to the actual paper, which are not imported.
-
Besides text changes in text PDFs, this would be even more helpful if it could produce an image with a visual indication of:
- Changes to non-text PDFs. Much needed for textual changes to scanned, tex…
-
I now have a couple of examples of scanned PDFs that I have taken through ABBY 11 for OCR. When I try to open the resulting pdfs in GG Imagine I get the message: "Unable to find images for all pages".…
-
I would like to move my work extracting images from pdfs from calling pdfimages to your package. Would you be interested in including this command to an R function?
I feel this would open up more…
-
```
* Xreader version 3.6.3
* Distribution - Mint 21.1
```
**Issue**
Printing some PDF files (not all) results in loss of data on the printed pages. I've experienced this issue with a few PD…
-
Is there a list of file types that can be scanned? For instance not just traditional image files, but also file types like PDFs, docs, txt, etc.
-
I've run into an issue with modifying certain pdf files. The issue seems happen only with pdf files that were generated by a scanner. I'm successfully adding serial numbers to the bottoms of every pag…
-
Stemming from on [an email discussion](https://groups.google.com/g/islandora-dev/c/oPr1ZsJx-HA):
Hypercube currently uses pdftotext to extract text embedded in a PDF OR tesseract to perform OCR on …
-
https://github.com/phuoc-ng/react-pdf-viewer/issues/278
https://github.com/phuoc-ng/react-pdf-viewer/issues/276
https://github.com/phuoc-ng/react-pdf-viewer/issues/263
I get all 3 scenarios here …