IST-DASLab / QUIK

Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024
Apache License 2.0
171 stars 12 forks source link

add y in asy fused dequant #5

Closed xcwang1999 closed 1 year ago