WorksApplications / SudachiPy

Python version of Sudachi, a Japanese tokenizer.
Apache License 2.0
388 stars 50 forks source link

Inconsistency on dictionary_form and reading_form fields while analyzing the contexts including specific symbol chars after white space #142

Open sorami opened 4 years ago

sorami commented 4 years ago

https://github.com/explosion/spaCy/issues/5961#issuecomment-679967076