Different OCR results for images - Githubissues

TheJoeFin / Text-Grab

Use OCR in Windows quickly and easily with Text Grab. With optional background process and notifications.

https://www.microsoft.com/en-us/p/text-grab/9mznkqj7sl0b?cid=TextGrabGitHub

MIT License

3.18k stars 218 forks source link

Different OCR results for images #416

Open vivadavid opened 8 months ago

vivadavid commented 8 months ago

Describe the bug Using the same images, I get different OCR results depending on whether I use the Extract Text from Images in Folder tool or I simply drag and drop the images on the Edit Text Window. This only happens when using Spanish Tesseract, whereas if I use the Microsoft OCR engine for Spanish, I get the same recognized text no matter the approach.

These are my results (see attached ZIP file):

On image1.jpg, no text is recognized (but it's recognized when I drag and drop the image).
On image2.jpg, the text is recognized (but it's not exactly the same text as the text recognized when I drag and drop the image).

Where is the bug

OCR Output.

Where did you get Text Grab?

Exe

Desktop (please complete the following information):

OS: Windows 11.
Version 23H2.
Text Grab 4.3.1.