TheJoeFin / Text-Grab

Use OCR in Windows quickly and easily with Text Grab. With optional background process and notifications.
https://www.microsoft.com/en-us/p/text-grab/9mznkqj7sl0b?cid=TextGrabGitHub
MIT License
3.15k stars 216 forks source link

Bad text recognition #479

Closed Zio3D closed 1 month ago

Zio3D commented 2 months ago

Describe the bug Does not correctly understand the text of the Ubuntu server console. Image for the OCR

Where is the bug

To Reproduce Steps to reproduce the behavior: Grab Frame on the image attached

Expected behavior That all characters are recognized correctly as there is no video interference that distorts the text or creates ambiguity about the characters.

Screenshots

Where did you get Text Grab?

Desktop (please complete the following information):

Additional context

If I OCR the attached image (I opened it with Microsoft Paint of Windows 10 Pro) with zoom at 100%, I get the following text:

[ CIK ] Reached target System Power Off.
[ 1745.141662] sgstemd-shutdown [1] : Haiting for process: java

If I OCR the attached image (I opened it with Microsoft Paint of Windows 10 Pro) with zoom at 200%, I get the following text:

[ OK ] Reached target System Power off.
[ 1745.141662] sgstemd-shutdau-lr-l [1] : Hai ting far pracess: java

The correct text is:

[ OK ] Reached target System Power Off.
[ 1745.141662] systemd-shutdown [1]:  Waiting for process: java

With the comand: Get-WindowsCapability -Online | Where-Object { $_.Name -Like 'Language.OCR*' }, I have these languages ​​installed:

Name  : Language.OCR~~~en-US~0.0.1.0
State : Installed

Name  : Language.OCR~~~it-IT~0.0.1.0
State : Installed

Ubuntu console font parameters: FONTFACE="Fixed" FONTSIZE="8x16"

TheJoeFin commented 1 month ago

This is a bug with the underlying OCR models being used. I am eager to use updated models when they become available to 3rd party developers, but currently the best way to improve would be to use the Text Grab settings to install Tesseract and use those models.

Zio3D commented 1 month ago

OK, thanks, I'll follow your advice 😃