JnRMnT / ZemberekDotNet

ZemberekDotNet is the .NET Port of Zemberek-NLP (Natural Language Processing tools for Turkish).
Apache License 2.0
15 stars 4 forks source link
csharp language machine-learning morphology natural-language-processing nlp nuget turkish zemberek zemberek-nlp

ZemberekDotNet

Test Status Code Coverage

Build Status Release Status

ZemberekDotNet is the C#/.NET Port of Zemberek-NLP (Natural Language Processing tools for Turkish).

This library will be kept in sync with Zemberek-NLP and same module structure will be maintained in .NET platform using NuGet packages under seperate projects.

Modules

Module Package Name Description Status
All ZemberekDotNet.All Wrapper Package that includes all the modules. NuGetNuGet
Core ZemberekDotNet.Core Special Collections, Hash functions and helpers. NuGet NuGet
Morphology ZemberekDotNet.Morphology Turkish morphological analysis, disambiguation and word generation. NuGet NuGet
Tokenization ZemberekDotNet.Tokenization Turkish Tokenization and sentence boundary detection. NuGet NuGet
Normalization ZemberekDotNet.Normalization Basic spell checker, word suggestion. Noisy text normalization. NuGet NuGet
NER ZemberekDotNet.NER Turkish Named Entity Recognition. NuGet NuGet
Classification ZemberekDotNet.Classification Text classification based on Java port of fastText project. NuGet NuGet
Language Identification ZemberekDotNet.LangID Fast identification of text language. NuGet NuGet
Language Modeling ZemberekDotNet.LM Provides a language model compression algorithm. NuGet NuGet
Applications ZemberekDotNet.Apps Console applications Pending
gRPC Server ZemberekDotNet.GRPC gRPC server for access from other languages. Pending
Examples ZemberekDotNet.Examples Usage examples. Pending

Target Platforms

Packages are targeting .NET Standart 2.1 Framework so that it can be used within .Net Core and .Net Framework projects. Examples/console applications will also be prepared with .Net Core aiming that the whole library can be used cross platform.

CI/CD

Repository is configured to continuously trigger a build, test and release cycle using Azure DevOps. At the end of a successful release, it automatically publishes the artifacts to NuGet.org.