Closed jason-zou closed 1 week ago
If you are using a model in GPTQ format, you can specify -m gptq-auto to automatically detect the kernels and other quantization configurations. Check the usage section for more details.
-m gptq-auto
If you are using a model in GPTQ format, you can specify
-m gptq-auto
to automatically detect the kernels and other quantization configurations. Check the usage section for more details.