Closed yuhuixu1993 closed 4 weeks ago
Thank you for interesting in our work!
We didn't implement 1bit quantized matmul CUDA kernel. Currently we only implemented 4bit kernel and 2bit kernel
From accuracy side, you can check the 1bit KIVI performance through simulation or fake quantization (using floating point number to simulate the 1bit integer number)
@zirui-ray-liu ,sure thank you
Hi, thank the authors for presenting the great work. I am very interested in the performance of kivi with 1bit quantization. How ever It seems do not work. The errors are bellow. Any ideas about that? Many thanks.