mathewthe2 / Game2Text

Complete toolbox for gamifying language learning
https://www.Game2Text.com
Apache License 2.0
182 stars 23 forks source link

Add custom OCR engines #40

Open GrimPixel opened 2 years ago

GrimPixel commented 2 years ago

Currently the version of supported Tesseract is 4.1.1, which is already another legacy version. It would be best to let users download engines and use them.

mathewthe2 commented 2 years ago

I can see use cases with Tesseract5, manga_ocr, and paddle ocr.

It's pretty straightforward to include them in the build, but it will introduce lots of dependencies, greatly increasing the size and weight of the app.

For optional downloads, however, it will require making separate binaries for the user to easily download and use them. I will get back if I manage to package them or if someone can supply them.