cjnevin / Dawg

Directed acyclic word graph in Swift
MIT License
6 stars 2 forks source link

Turkish characters #1

Closed melihbuyuk closed 7 years ago

melihbuyuk commented 7 years ago

Hi Chris,

In Turkish characters, it does not work. How is the problem?

This is the file : turkish_raw.txt

Thanks...

cjnevin commented 7 years ago

Thanks for your interest in this project, I actually only ran the tests on French and English languages. I will need to try this out and get back to you about why it is having problems, my guess would be it has something to do with the character length being calculated incorrectly somewhere, since UTF8 characters can contain multiple bytes.

cjnevin commented 7 years ago

@melihbuyuk Please try the new version, it supports unicode characters.