curiosity-ai / catalyst

🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
MIT License
715 stars 73 forks source link

Investigate porting newer CLD3 model #26

Closed theolivenbaum closed 3 years ago

theolivenbaum commented 4 years ago

Google published a new language detection model that superseeds their former CLD2 model: https://github.com/google/cld3

Might be interesting as a lighter alternative to our current FastText model.