AutoGPTQ / AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
MIT License
4.06k stars 416 forks source link

Add support for Gemma2 models. #700

Open markoarnauto opened 4 days ago

markoarnauto commented 4 days ago

Would be nice to have support for Gemma2.