CUDA error - the provided PTX was compiled with an unsupported toolchain

marella / ctransformers

Python bindings for the Transformer models implemented in C/C++ using GGML library.

MIT License

1.76k stars 137 forks source link

In case someone else encounters the same issue, the problem was caused by having a nvcc version not compatible with the GPU driver version.
When installing with pip install ctransformers[cuda] precompiled libs for CUDA 12.2 are used, but in my cases I needed CUDA version 12.0. If I used CT_CUBLAS=1 pip install ctransformers --no-binary ctransformers by default the CUDA compiler path was /usr/bin/ which in my case had an older version of nvcc. The solution was to install the right CUDA version in a different path and then install ctransformers with: CMAKE_ARGS="-DCMAKE_CUDA_COMPILER=/path_to_cuda/bin/nvcc" CT_CUBLAS=1 pip install ctransformers --no-binary ctransformers

marella / ctransformers

CUDA error - the provided PTX was compiled with an unsupported toolchain #162