Closed tjtanaa closed 7 months ago
Hi @fxmarty the kernels in the v0.2.x ports were built upon vllm-project#1313 with some modifications for them to build in our environments, as well as the inclusion of squeezellm quantization kernels. Thank you
I see thank you!
Hi @tjtanaa, I am wondering if the work being done in this repo is different (kernel-wise) from https://github.com/vllm-project/vllm/pull/1313?
Thank you!