We recently found that AutoGPTQ vendors its own versions of exllama and exllamav2 kernels in augotgptq_extension that are installed with the library. Since we install AutoGPTQ after we installed our own builds of the exllama kernels, the AutoGPTQ ones overwrite our copies. So it turns out that we don't need to vendor and compile our own exllama kernels.
Modifications
Removes the vendored copies of exllama kernels.
Result
There should be no functional changes other than faster build times and less code.
Motivation
We recently found that AutoGPTQ vendors its own versions of exllama and exllamav2 kernels in augotgptq_extension that are installed with the library. Since we install AutoGPTQ after we installed our own builds of the exllama kernels, the AutoGPTQ ones overwrite our copies. So it turns out that we don't need to vendor and compile our own exllama kernels.
Modifications
Removes the vendored copies of exllama kernels.
Result
There should be no functional changes other than faster build times and less code.