WorksApplications / SudachiPy

Python version of Sudachi, a Japanese tokenizer.
Apache License 2.0
392 stars 50 forks source link

improve character category search #81

Closed izziiyt closed 5 years ago

izziiyt commented 5 years ago

reconstruct range_list in CharacterCategory to apply binary search in get_character_category