IST-DASLab / QUIK

Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024
Apache License 2.0
172 stars 12 forks source link