issues
search
mit-han-lab
/
smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
https://arxiv.org/abs/2211.10438
MIT License
1.27k
stars
150
forks
source link
add llama model support
#57
Open
AniZpZ
opened
1 year ago
AniZpZ
commented
1 year ago
support llama model quant
MaddyThakker
commented
1 week ago
Has anyone tried this?
support llama model quant