Open Semihal opened 4 months ago
I plan to use this container
I'm confused, do you want a container or a binary? If you want a container why not use the official one or the official command?
I'm confused, do you want a container or a binary?
I want to install TEI in a container image for future use.
If you want a container why not use the official one or the official command?
These are the instructions from the official Docker.
For clarity. The executable code looks exactly like this (from the official Docker image):
export CUDA_COMPUTE_CAP=86
export CUDA_HOME=/usr/local/cuda-12.1
export PATH=${PATH}:/usr/local/cuda-12.1/bin
# Limit parallelism
export CARGO_BUILD_JOBS=1
export RAYON_NUM_THREADS=1
export CARGO_BUILD_INCREMENTAL=true
if [ ${CUDA_COMPUTE_CAP} -ge 75 -a ${CUDA_COMPUTE_CAP} -lt 80 ];
then
nvprune \
--generate-code code=sm_${CUDA_COMPUTE_CAP} \
/usr/local/cuda/lib64/libcublas_static.a -o /usr/local/cuda/lib64/libcublas_static.a;
elif [ ${CUDA_COMPUTE_CAP} -ge 80 -a ${CUDA_COMPUTE_CAP} -lt 90 ];
then
nvprune \
--generate-code code=sm_80 \
--generate-code code=sm_${CUDA_COMPUTE_CAP} \
/usr/local/cuda/lib64/libcublas_static.a -o /usr/local/cuda/lib64/libcublas_static.a;
elif [ ${CUDA_COMPUTE_CAP} -eq 90 ];
then
nvprune \
--generate-code code=sm_90 \
/usr/local/cuda/lib64/libcublas_static.a -o /usr/local/cuda/lib64/libcublas_static.a;
else
echo "cuda compute cap ${CUDA_COMPUTE_CAP} is not supported"; exit 1;
fi;
if [ ${CUDA_COMPUTE_CAP} -ge 75 -a ${CUDA_COMPUTE_CAP} -lt 80 ];
then
cargo chef cook --release \
--features candle-cuda-turing \
--features static-linking \
--no-default-features \
--recipe-path recipe.json && \
sccache -s;
else
cargo chef cook --release \
--features candle-cuda \
--features static-linking \
--no-default-features \
--recipe-path recipe.json && \
sccache -s;
fi;
System Info
Version: v.1.4.0 Cargo version: cargo 1.79.0 (ffa9cf99a 2024-06-03) GCC version: 11.4.1 GPU: Compile with CUDA_COMPUTE_CAP=86 on machine without GPU (but with CUDA 12.1). I plan to use this container with A40, but I don't have a GPU to build it.
Information
Tasks
Reproduction
I start this script:
I get this error:
Expected behavior
TEI compiled.