mathewthe2 / Game2Text

Complete toolbox for gamifying language learning
https://www.Game2Text.com
Apache License 2.0
186 stars 24 forks source link

Make other language support more obvious #48

Open artjomsR opened 1 year ago

artjomsR commented 1 year ago

This tool is correctly advertised as working for all languages but out of the box works only with Japanese and it's not obvious how to use it for other languages. This will make the tool more accessible to all language learners. Suggested changes:

1) Add an option to UI settings to select a language for OCR

OR

2) Add documentation to make it more obvious how the user can do the same manually. Here's my attempt: In config.ini, change values according to https://tesseract-ocr.github.io/tessdoc/Data-Files-in-different-versions.html and replace them in these lines

tesseract_language = jpn
ocr_space_language = jpn

Download XYZ.traineddata for your language from https://github.com/tesseract-ocr/tessdata_best/ (OR https://github.com/tesseract-ocr/tessdata) and put it in the game2text\resources\bin\win\tesseract\tessdata folder

drewboardman commented 4 months ago

How does one actually use it for other languages?

artjomsR commented 4 months ago

@drewboardman You should be able to follow the instructions in my comment above (after Here's my attempt: part). This worked for me with non-Japanese language at the time of writing the comment