Open wenhuach21 opened 20 hours ago
@wenhuach21 It appears there are two issues.
Need linux os version, kernel, libc/glibc version
Thanks. @CSY-ModelCloud
We have renamed gptqmodel_marlin_cuda_inference. Can you try to pull latest and delete build dir? Then pip install it.
Got it. It would be beneficial for GPTQModel to provide a backward-compatible API for layer packing and repacking, accommodating both the original AutoGPTQ linear layer and your/AutoRound fixed zero-point layer in future implementations. This would allow seamless reliance on your CUDA kernels for Marlin, asymmetric quantization, and other operations in AutoRound side.
Got it. It would be beneficial for GPTQModel to provide a backward-compatible API for layer packing and repacking, accommodating both the original AutoGPTQ linear layer and your/AutoRound fixed zero-point layer in future implementations. This would allow seamless reliance on your CUDA kernels for Marlin, asymmetric quantization, and other operations in AutoRound side.
We are adding hf_select_quant_linear
as external api for HF/optimum repo. Can autoeound use this? Api is going stable later today/tonight.
Tracking PR: https://github.com/ModelCloud/GPTQModel/pull/713
Code is not ready. We are finalizing it still. The above pr holds links to hf/optimum pr that will be submitted upstream.
[1-3] https://github.com/ModelCloud/GPTQModel/pull/727/files
we will expose the 3 hf_
prefixed as stable api to hf/optimum. May still be changes. wip.
Correction: 4 hf_methods
Thanks for the info. However, this may not help in our side, we need layer-wise packing and repacking as autoround could support mixed bits or mixed group size .
when pip install, marlin kernel could not find ValueError: Trying to use the marlin backend, but could not import the C++/CUDA dependencies with the following error: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.32' not found (required by /home/wenhuach/anaconda3/envs/autoround/lib/python3.10/site-packages/gptqmodel_marlin_cuda_inference.cpython-310-x86_64-linux-gnu.so)
when install from source