Open Huyueeer opened 6 months ago
We have apply AWQ on Qwen models, as same as other LLMs
Qwen
same as other LLMs, we fail to apply awq in deepseek-v2 caused on unsupported module like mla(multi-latent attention)
We have apply AWQ on Qwen models, as same as other LLMs
Hi, how about the ppl of wikitext 2 ?
Any plans to support quantitative reasoning for qwen models?