Closed lstocchi closed 3 weeks ago
The problem seems to be that i don't have a NVIDIA gpu and the intel one that i have it is not supported by CUDA. If i disable cuda by updating the containerfile
ENV CMAKE_ARGS="-DLLAMA_CUBLAS=off"
ENV FORCE_CMAKE=0
it works but it is extremely slow. I guess we need a different base image based on the gpu the user is using
Closing obsolete
When running the x86 model_service image you face this error