Open yafehlis opened 2 months ago
INT8 quantization works fine, but INT4 does not work.
Yeah, int4 quantization doesn't work on AMD GPUs right now.
INT8 quantization works fine, but INT4 does not work.![Capture](https://github.com/pytorch-labs/gpt-fast/assets/106262476/ac10df53-860e-4da9-b51e-1ad17e3fe3c4)