Strange work of the standard OCR tool

ogkalu2 / comic-translate

Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.

Apache License 2.0

1.15k stars 97 forks source link

Strange work of the standard OCR tool #153

Open TaunT opened 1 month ago

TaunT commented 1 month ago

This is probably a life hack =)

I usually translate comics from English, and there were always a lot of mistakes when digitizing. Yesterday I translated a page from French and forgot to switch back, today I started digitizing from English and everything went perfectly, not a single mistake.

I checked - the result is constant. If you select French when digitizing English text, the result is much better!)

ogkalu2 commented 1 month ago

Are you on default when you do this ?

TaunT commented 1 month ago

Was in my branch. But now I switched to the main one and checked again. If the language in the settings is English - it recognizes worse If French - perfect)

ogkalu2 commented 1 month ago

No what I mean is, what OCR option do you have in settings when this happens? Default, Microsoft or Google ?

TaunT commented 1 month ago

Ah, it all turned out to be very simple) For English, the simplest model is used, and for French, GPT-4o

from readme.md

OCR

By Default:

EasyOCR for English
GPT-4o for French, Russian, German, Dutch, Spanish and Italian. Paid, Requires an API Key.

TaunT commented 1 month ago

@ogkalu2

No what I mean is, what OCR option do you have in settings when this happens? Default, Microsoft or Google ? I didn't understand at first) Yes, it is Default in the settings

ogkalu2 commented 1 month ago

Yes that's it. Pororo also supports English. I think it's better also. Maybe I should switch to that for Default ?

Can you set your source language to Korean and see if the English results are better than EasyOCR ?

TaunT commented 1 month ago

in "Korean" mode some words were not recognized and replaced with hieroglyphs.

It is necessary to try to change the default model in the code.

By the way, with GPT there is also fix a problem of intersecting blocks

TaunT commented 1 month ago

@ogkalu2 Would it be nice to add the choice of GPT to the OCR settings?

ogkalu2 commented 1 week ago

in "Korean" mode some words were not recognized and replaced with hieroglyphs.

It is necessary to try to change the default model in the code.

By the way, with GPT there is also fix a problem of intersecting blocks

In modules/ocr, line 203, change self.pororo_cache = PororoOcr() to self.pororo_cache = PororoOcr(lang='en') and see if it's better(still keep the source lang a Korean in the UI)

What do you mean by the problem of intersecting blocks ?

ogkalu2 commented 1 week ago

@ogkalu2 Would it be nice to add the choice of GPT to the OCR settings?

Might do this yeah.