Python Gpt4all, Docker Linux Container with Cuda 12, complains about missing Cuda 11 and uses CPU only

Bug Report

Hi, using a Docker container with Cuda 12 on Ubuntu 22.04, the Nvidia GForce 3060 is working with Langchain (e.g. when using a local model), but the Langchain Gpt4all Functions from GPT4AllEmbeddings raise a warning and use CPU only:

Failed to load libllamamodel-mainline-cuda.so: dlopen: libcudart.so.11.0: cannot open shared object file: No such file or directory Failed to load libllamamodel-mainline-cuda-avxonly.so: dlopen: libcudart.so.11.0: cannot open shared object file: No such file or directory

What is the typical Linux setup for Cuda 12, and/or which packages are supposed to exist so that the Cuda 12 functionality is recognized in Gpt4all, and Cuda 11 is not being searched for?

Thank you.

Example Code

from langchain_chroma import Chroma from langchain_community.embeddings import GPT4AllEmbeddings ... model_name = "nomic-embed-text-v1.5.f16.gguf" #"all-MiniLM-L6-v2.gguf2.f16.gguf" gpt4all_kwargs = {"allow_download": "True"} embeddings = GPT4AllEmbeddings(model_name=model_name, gpt4all_kwargs=gpt4all_kwargs) ... vectorstore = Chroma(collection_name="rag-embeddings", persist_directory=persist_directory, embedding_function=embeddings) vectorstore.reset_collection() ... vectorstore.add_documents(documents=[some_documents], ids=[some_ids])

Steps to Reproduce

Docker Container, image nvidia/cuda:12.6.1-devel-ubuntu22.04 with GPU enabled and working
Using Gpt4All Python / Langchain methods as stated above

Expected Behavior

There should be no warnings, but Cuda 12 recognized and working with gpt4all

Your Environment

Python 3.10, Langchain, Bindings version 2.8.2
Operating System: Ubuntu 22.04 in Docker container nvidia/cuda:12.6.1-devel-ubuntu22.04
Also tried Image with Cuda 12.0
Cuda 12 working but not with Gpt4all
With or without Python packages nvidia-cuda-runtime-cu12 and nvidia-cublas-cu12
Chat model used (if applicable): tried to use GPT4AllEmbeddings

nomic-ai / gpt4all