mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
https://arxiv.org/abs/2211.10438
MIT License
1.27k stars 150 forks source link

add llama model support #57

Open AniZpZ opened 1 year ago

AniZpZ commented 1 year ago

support llama model quant

MaddyThakker commented 1 week ago

Has anyone tried this?