dotnet / machinelearning

ML.NET is an open source and cross-platform machine learning framework for .NET.
https://dot.net/ml
MIT License
8.92k stars 1.86k forks source link

Track Tokenizers design feedback #6982

Closed tarekgh closed 4 months ago

tarekgh commented 5 months ago

This issue to track investigating and address the feedback we got regarding the tokenizers design

@stephentoub Feedback

If we’re able to make such breaking changes, we should also be reconsidering other aspects of the library then I think, in particular for perf, for example: