RWKV / rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
MIT License
1.13k stars 82 forks source link

Update ggml #128

Closed saharNooby closed 10 months ago

saharNooby commented 10 months ago

Notable improvements:

k-quants support was NOT added since there is no support yet for them in the main ggml repo.

On the internal side: