mit-han-lab / smoothquant

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
https://arxiv.org/abs/2211.10438
MIT License
1.26k stars 150 forks source link

llama-2-chat demo #61

Closed liquanfeng closed 11 months ago

liquanfeng commented 1 year ago

PTAL, thanks! @Guangxuan-Xiao @tonylins