casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
https://casper-hansen.github.io/AutoAWQ/
MIT License
1.76k stars 211 forks source link

Support for JAIS #424

Open 7ossam81 opened 7 months ago

7ossam81 commented 7 months ago

Does AutoAWQ plan to support JAIS model quantization?

https://huggingface.co/core42/jais-30b-v3 https://huggingface.co/core42/jais-30b-chat-v3

beratcmn commented 6 months ago

is there any news about this issue?