BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
190
stars
21
forks
source link
[Bug] Improve the Default Config Value and fix a Bug for TensorCore Config with Small shapes #32
Closed
LeiWang1999 closed 1 month ago