Open mohammad-qloo opened 2 months ago
I haven't tried to reproduce this, but there should be a new release of the Python bindings soon. In fact, it seems like it's only being held up by an issue with CI. Oh and also some PyPI limitations.
Can you try it again once that is available?
Also see PR #2802 and note specifically:
- Also search for CUDA 11 installed with pip at runtime since we now build against CUDA 11.8 anyway
For now it should be sufficient to pip install nvidia-cublas-cu12 nvidia-cuda-runtime-cu12
as long as your GPU driver is somewhat recent (at least 525.60.13 if we provide binary support for your GPU architecture, or 555.58 if we don't).
Support for CUDA 11 will be available in the next Python release (possibly 2.8.1).
I tried to run in CPU but getting cuda error
Bug Report
Running this in Linux but getting the following error.
It seems
_pyllmodel.py
has this block of code that is trying to import the cuda 12 files. I although have cuda 11.8 in the system I want to run it on CPU due to insufficient memory.Your Environment