dotnet / machinelearning

ML.NET is an open source and cross-platform machine learning framework for .NET.
https://dot.net/ml
MIT License
8.92k stars 1.86k forks source link

Improve Microsoft.ML.Tokenizers and drive complete 1.0 API #6984

Open ericstj opened 4 months ago

ericstj commented 4 months ago

Goal: Enable .NET developers to use tokenizers in their data pre-processing pipelines as part of their embedding and token generation tasks using language models.

Committed:

Backlog: