-
### In what scenarios do you need this feature?
After configuring siyuan's OCR, I felt that the recognition rate was low. Later, then I switched to software that invoke the paddleOCR API and found th…
-
**Is your feature request related to a problem? Please describe.**
Instead of having to rely on a cloud service, e.g. using Azure AI Document Intelligence in the current state, it would be very neat …
-
Hello guys. Thank you so much for this brilliant Model.
I'm aware that Donut is an OCR-free model which does not rely on an OCR input. When I performed some tests (fine-tuning the model), I realized…
-
环境:
![image](https://github.com/peakhell/OCRIntegrator/assets/86536994/91f3b191-1c2c-4051-84f4-1abbf9d40f34)
```
(ocr) ➜ OCRIntegrator git:(main) ✗ pip list | grep tensor
nvidia-tensorrt …
-
1. this image is recognized by ocrad:
![06_mcr](https://cloud.githubusercontent.com/assets/1483884/15124696/48b5e068-15f7-11e6-86d6-f6129cb011b0.png)
2a. that one is not (recognized):
![11_mcr-ext…
-
ocrmypdf works great with pdfs with scanned images . However in case of handwritten letter, the tessaract-ocr engine struggles many a time.
How do I use Azure ocr API as the OCR engine keeping everyt…
-
Thanks for your great work! But it still has some problems. I have a PDF, which is not scanned(you can select the words in the files). When using your method, it will recognize 'benefit' as 'benets'. …
-
I have issue when I add the Thai language to Scribe OCR as follows:
1. I just add the tha.traineddata.gz to \tess\lang but the Console log show "Error: Tesseract (legacy) engine requested, but comp…
-
I looked through the code and the current PDF loader used is PyMuPDF. Within the free libraries, PDFMiner works better than PyMuPDF and PyPDF so it would be good to have it. Additionally, documents th…
-
The OCR engine does increase the attack surface of Dangerzone, this has been a longstanding hypothesis I've had. We just don't know how much. And recently in the Dangerzone security audit, the auditor…