Hi
As I tried with 13b version in TGI, it works fine with bitsandbytes quantization.
While trying with AWQ quantization in TGI, it shows error as "Cannot load 'awq' weight, make sure the model is already quantized"
I am wondering if AWQ is too new to this model while deploying by TGI
Or there is any suggestion or comment?
Thanks
Hi As I tried with 13b version in TGI, it works fine with bitsandbytes quantization. While trying with AWQ quantization in TGI, it shows error as "Cannot load 'awq' weight, make sure the model is already quantized" I am wondering if AWQ is too new to this model while deploying by TGI Or there is any suggestion or comment? Thanks