Bookworm-project / BookwormDB

Tools for text tokenization and encoding
MIT License
84 stars 12 forks source link

Unicode patch #125

Closed bmschmidt closed 7 years ago

bmschmidt commented 7 years ago

Proposed fix for Arabic and Hindi. Discussion here