Open lucasjinreal opened 1 year ago
It's using Rayon for data parallel matrix vector mult, but no other libraries.
See the rust library Candle
which has a full implementation with matrix mults.
Was thinking I would try implementing GGML-style quantization. Any other features you would want?
Yes, would like ask some more questions:
does it using any mat calculation accelerate framework from rust's lib? any plan to make it further, for instance, make it like ggml popular