RWKV / rwkv.cpp

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
MIT License
1.37k stars 90 forks source link

Various improvements #104

Closed saharNooby closed 1 year ago

saharNooby commented 1 year ago

Most noteworthy is cuBLAS on Windows documentation improvement. Less noteworthy is more useful contract of rwkv_gpu_offload_layers.