unicode-org / unilex

Lexical data at Unicode
Other
63 stars 16 forks source link

Thai #8

Closed r12a closed 5 years ago

r12a commented 5 years ago

Am i missing something? I can't find a frequency list for Thai.

brawer commented 5 years ago

Indeed, but contributions are very welcome. To get started, write a crawler to collect a Thai corpus, for example by extending Corpus Crawler. Word segmentation is tricky for Thai but there’s libraries.

r12a commented 5 years ago

Ok, thanks for confirming. I'll close this then.