Closed wenhuach21 closed 1 month ago
w2g32 accuracy verified, ok mixed bits verified, ok issue: triton kernel issue for low cuda version
Credit goes to @Qubitium (https://github.com/AutoGPTQ/AutoGPTQ/pull/640) and the GPTQ community.
w2g32 accuracy verified, ok mixed bits verified, ok issue: triton kernel issue for low cuda version