English text looks strange despite having Tesseract

You will additionally need to enable tesseract in your current profile, in the preprocessor settings. Don't forget to hit apply after making changes in the profile.

Tesseract is optional and isn't so good with ALL CAPS so it's off by default. There is the problem with relying on the text detector to figure out what language a bubble is. It can only detect japanese and english as languages, but can still recognize latin text, so it calls spanish english, usually. I will add a language override in the next release so you can tell it what language to use, ignoring the detected language. That way spanish and maybe chinese should become supported by tesseract. (I'll work on that in September)

Civvic is also experimenting with visual LLMs that are remarkably good at OCR (both local and api-based) which will open the door to much better OCR in the future.

Until then, you can also manually correct OCR with the review mode, which is on by default. That's new in the latest version.

Good luck, glad to hear it's been helpful.

VoxelCubes / PanelCleaner

English text looks strange despite having Tesseract #109