RWKV / rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
MIT License
1.37k stars 90 forks source link

Fix visual bug in quantization #92

Closed LoganDark closed 1 year ago

LoganDark commented 1 year ago

It didn't calculate the compression ratio properly because of a copy/paste error :(