TheJoeFin / Text-Grab

Use OCR in Windows quickly and easily with Text Grab. With optional background process and notifications.
https://www.microsoft.com/en-us/p/text-grab/9mznkqj7sl0b?cid=TextGrabGitHub
MIT License
3.18k stars 218 forks source link

Different OCR results for images #416

Open vivadavid opened 8 months ago

vivadavid commented 8 months ago

Describe the bug Using the same images, I get different OCR results depending on whether I use the Extract Text from Images in Folder tool or I simply drag and drop the images on the Edit Text Window. This only happens when using Spanish Tesseract, whereas if I use the Microsoft OCR engine for Spanish, I get the same recognized text no matter the approach.

These are my results (see attached ZIP file):

  1. On image1.jpg, no text is recognized (but it's recognized when I drag and drop the image).
  2. On image2.jpg, the text is recognized (but it's not exactly the same text as the text recognized when I drag and drop the image).

Where is the bug

Where did you get Text Grab?

Desktop (please complete the following information):

files.zip