Closed zhangdanfeng888 closed 2 months ago
How did you install TEI? Did you install it with cuda support? Do you have a supported GPU (cuda capability > 7.5)?
How did you install TEI? Did you install it with cuda support? Do you have a supported GPU (cuda capability > 7.5)?
I install TEI with your step CUDA: cargo install --path router -F candle-cuda-turing -F http --no-default-features, and the TEI is successfully installed, I think. My CUDA version is 12.4 and also add the nvidia binaries to path. My GPU is 3080, is it enough?
Use cargo install --path router -F candle-cuda -F http --no-default-features
instead.
Note the candle-cuda
instead of candle-cuda-turing
.
A 3080 is not a turing GPU but an Ampere one.
The candle-cuda-turing
feature should only be used for old GPUs like T4s.
Also, 1.3 had a bug for mistral. I strongly advise you to update either to latest or 1.4.
@OlivierDehaene Okay, I will try 1.4 via cargo install --path router -F candle-cuda -F http --no-default-features, thanks a lot
I run this: text-embeddings-router --model-id Salesforce/SFR-Embedding-Mistral --port 8080 --dtype float16
Then I get the following error: I have installed flash-attn:
The same error when I run: text-embeddings-router --model-id Salesforce/SFR-Embedding-2_R --port 8080 --dtype float16 What's wrong?