SubhamTyagi / android-ocr

Tesseract based OCR for android
Apache License 2.0
365 stars 61 forks source link

Add preserve_interword_spaces=1 argument #109

Open languagemaniac opened 10 months ago

languagemaniac commented 10 months ago

Hi, when I OCR Japanese text, it comes out with random spaces.

This can be easily fixed by adding "-c preserve_interword_spaces=1" as an argument when executing tesseract.

I tried it on my PC with the same result. Adding that argument fixes the issue.

For what I've been reading, it's the same for Chinese and Korean, (though I haven't tried with those) so maybe there should be an option to enable / disable that specific argument, as these languages don't have any interword spaces whatsoever.

T8RIN commented 4 months ago
languagemaniac commented 4 months ago
  • I will try to add this in my app

Which app?

T8RIN commented 4 months ago

ImageToolbox