curiosity-ai / catalyst

🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
MIT License
715 stars 73 forks source link

chinese support? #32

Closed mingyangzhu closed 4 years ago

mingyangzhu commented 4 years ago

can support chinese?

leonzucchini commented 4 years ago

Catalyst does not currently support Chinese, Japanese, or Korean because tokenisation is completely different from European languages. It may be added in the future.