WorksApplications / SudachiPy

Python version of Sudachi, a Japanese tokenizer.
Apache License 2.0
392 stars 50 forks source link

Improve Speed #78

Closed izziiyt closed 5 years ago

izziiyt commented 5 years ago

74

30% improved when I tokenize neko.txt https://github.com/haradatm/nlp/blob/master/rnnlm/datasets/soseki/neko.txt