issues
search
mit-han-lab
/
llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
MIT License
2.08k
stars
150
forks
source link
Add Mistral & Mixtral support
#174
Open
Sakits
opened
2 months ago