Open shaltielshmid opened 9 months ago
Tagging subscribers to this area: @dotnet/area-system-numerics-tensors See info in area-owners.md if you want to be subscribed.
Author: | shaltielshmid |
---|---|
Assignees: | - |
Labels: | `api-suggestion`, `area-System.Numerics.Tensors`, `untriaged` |
Milestone: | - |
There is a plan to add the full set of BLAS (level 1, 2, and 3) APIs; which would include basic matrix operations.
Due to the sheer scope of the work, a lot of this is incremental and will be added incrementally to ensure the best experience and API shape can be provided.
Background and motivation
Recently there's been a trend online of developing very minimal but efficient systems for LLM inference. After testing out TensorPrimitives vs the other numerical libraries out there, TensorPrimitives seems to be extremely efficient in comparison and I see an opportunity here to build a system similar to (vllm)[https://github.com/vllm-project/vllm] native to C# which can provide a solid base for LLM inferencing in C#.
API Proposal
API Usage
This can work for Vector / Matrix multiplication as well, by treating vectors as [1,N] matrices.
These are the two main operations I can think off the top of my head, but I'll continue editing as I think of more.
@luisquintanilla