tesseract-ocr / langdata

Source training data for Tesseract for lots of languages
Apache License 2.0
827 stars 886 forks source link

Missing Thaana.unicharset #95

Closed Shreeshrii closed 5 years ago

Shreeshrii commented 6 years ago

There is no script level unicharset for Thaana scrript - for Dhivehi language.

However, there are 4.0x traineddata files for both Thaana and div.

Shreeshrii commented 5 years ago

https://github.com/tesseract-ocr/langdata_lstm/tree/master/Thaana