Open aloksaurabh opened 9 months ago
Neither CUDA nor MPS are available - defaulting to CPU. Note: This module is much faster with a GPU.
Make sure you have cuda installed and accessible.
I have a 40gb model running on tabbyapi with multi gpu on the same machine in another conda. Something else is wrong. For Polymind in conda after installing requirement.txt still had to install a bunch of stuff including onnxruntime-gpu. Maybe you want to share the conda package list ?
There shouldnt be a need to install anything through conda. pip should be enough. You could try installing cuda through conda though
This seems to also happen when having installed torch CUDA and onnxruntime-gpu manually. Seems like fast sentence transformer has issues with its GPU package.
seems it happens if you build in docker, at build time gpu not available, so it choose wrong library, any idea how to fix ?
Installed first
Still running on CPU
I have a model running on tabbyapi with multi gpu