WorksApplications / SudachiTra

Japanese tokenizer for Transformers
Apache License 2.0
77 stars 10 forks source link

Feature/add cleaning and preprocessing #32

Closed t-yamamura closed 2 years ago

t-yamamura commented 2 years ago

https://github.com/WorksApplications/SudachiTra/issues/23