casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
https://casper-hansen.github.io/AutoAWQ/
MIT License
1.69k stars 204 forks source link

What‘s the difference between llm-awq and this? #563

Open LiMa-cas opened 2 months ago

LiMa-cas commented 2 months ago

Hi,What‘s the difference between llm-awq and autoawq?thanks in advance!!!

casper-hansen commented 2 months ago

llm-awq was the original research artifact and has not been maintained. AutoAWQ is the easy way to run the AWQ algorithm on transformer models and is integrated with Huggingface's transformers. You can also use vLLM for easy inference as it is directly compatible with AutoAWQ.