🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's design, it brings pre-trained models, out-of-the box support for training word and document embeddings, and flexible entity recognition models.
Expected behaviorA clear and concise description of what you expected to happen.
Tokenize and tag correctly
ScreenshotsIf applicable, add a code example to help explain your problem.
Here is my code:
Catalyst.Models.Chinese.Register(); //You need to pre-register each language (and install the respective NuGet Packages)
Storage.Current = new DiskStorage("catalyst-models");
var nlp = await Pipeline.ForAsync(Language.Chinese);
var doc = new Document("諸葛亮是三國時代著名軍師", Language.Chinese);
nlp.ProcessSingle(doc);
Console.WriteLine(doc.ToJson());
Language Which language(s) this issue relates to. Chinese
Describe the bug A clear and concise description of what the bug is. I cannot process chinese sentence correctly.
To Reproduce Steps to reproduce the behavior
Expected behavior A clear and concise description of what you expected to happen. Tokenize and tag correctly
Screenshots If applicable, add a code example to help explain your problem.
Here is my code:
Additional context Thank you for your help!