Open gfrankliu opened 2 weeks ago
# Docker comes with Debian 12
gfrankliu-t4-ws ➜ ~ docker version
Client:
Version: 20.10.24+dfsg1
API version: 1.41
Go version: go1.19.8
Git commit: 297e128
Built: Thu May 18 08:38:34 2023
OS/Arch: linux/amd64
Context: default
Experimental: true
Server:
Engine:
Version: 20.10.24+dfsg1
API version: 1.41 (minimum version 1.12)
Go version: go1.19.8
Git commit: 5d6db84
Built: Thu May 18 08:38:34 2023
OS/Arch: linux/amd64
Experimental: false
containerd:
Version: 1.6.20~ds1
GitCommit: 1.6.20~ds1-1+b1
runc:
Version: 1.1.5+ds1
GitCommit: 1.1.5+ds1-1+deb12u1
docker-init:
Version: 0.19.0
GitCommit:
gfrankliu-t4-ws ➜ ~ nvidia-container-cli --version
cli-version: 1.16.1
lib-version: 1.16.1
build date: 2024-07-23T14:57+00:00
build revision: 4c2494f16573b585788a42e9c7bee76ecd48c73d
build compiler: x86_64-linux-gnu-gcc-7 7.5.0
build platform: x86_64
build flags: -D_GNU_SOURCE -D_FORTIFY_SOURCE=2 -DNDEBUG -std=gnu11 -O2 -g -fdata-sections -ffunction-sections -fplan9-extensions -fstack-protector -fno-strict-aliasing -fvisibility=hidden -Wall -Wextra -Wcast-align -Wpointer-arith -Wmissing-prototypes -Wnonnull -Wwrite-strings -Wlogical-op -Wformat=2 -Wmissing-format-attribute -Winit-self -Wshadow -Wstrict-prototypes -Wunreachable-code -Wconversion -Wsign-conversion -Wno-unknown-warning-option -Wno-format-extra-args -Wno-gnu-alignof-expression -Wl,-zrelro -Wl,-znow -Wl,-zdefs -Wl,--gc-sections
gfrankliu-t4-ws ➜ ~
I am using the VM (Debian 12) in the GCP Cloud with GPU attached.
I have docker (default installation from Debian 12) on the VM, and installed nvidia container toolkit:
I am testing the docker but it can't find nvidia-smi:
It seems the nvidia container toolkit doesn't like it when cloud has nvidia driver in /var/lib/nvidia. If I manually volume mount the /var/lib/nvidia from the host into the container, it will work:
How can I tell nvidia-container-toolkit to automatically mount nvidia-smi into the container? The toolkit seems to only like nvidia-smi in the /usr/bin. eg: if I manually copy binaries from /var/lib/nvidia/bin to /usr/bin, I can then see nvidia-smi mounted into the container: