bryanbocao / awesome-cmd-cheatsheets

Quick search the commands you need.
3 stars 1 forks source link

could not select device driver "" with capabilities: [[gpu]] #3

Closed bryanbocao closed 2 years ago

bryanbocao commented 2 years ago
docker run --ipc=host --shm-size=16384m -it -v /:/share --gpus all --network=bridge <image_is> /bin/bash
docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]].
ERRO[0002] error waiting for container: context canceled
bryanbocao commented 2 years ago

Follow the instructions in https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#docker

E: Unable to locate package nvidia-docker2
bryanbocao commented 2 years ago

Tried

https://github.com/NVIDIA/nvidia-docker/issues/953#issuecomment-482452526

curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | \
  sudo apt-key add -
distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | \
  sudo tee /etc/apt/sources.list.d/nvidia-docker.list
sudo apt-get update

# Install nvidia-docker2 and reload the Docker daemon configuration
sudo apt-get install -y nvidia-docker2
sudo pkill -SIGHUP dockerd

bryanbocao commented 1 year ago

The following commands are sufficient to resolve this issue: https://github.com/NVIDIA/nvidia-docker/issues/887#issuecomment-447829585

sudo apt-get remove docker docker-engine docker.io

sudo apt-get install \
    apt-transport-https \
    ca-certificates \
    curl \
    software-properties-common

curl -fsSL https://download.docker.com/linux/ubuntu/gpg | sudo apt-key add -
sudo apt-key fingerprint 0EBFCD88

sudo add-apt-repository \
   "deb [arch=amd64] https://download.docker.com/linux/ubuntu \
   $(lsb_release -cs) \
   stable"

sudo apt-get update
sudo apt-get install docker-ce
sudo apt-get install nvidia-docker2

And finally https://www.cnblogs.com/dan-baishucaizi/p/15102419.html

distribution=$(. /etc/os-release;echo $ID$VERSION_ID)
curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add -
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.repo | sudo tee /etc/yum.repos.d/nvidia-docker.repo
curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list
sudo apt-get update && sudo apt-get install -y nvidia-container-toolkit
sudo systemctl restart docker

Then it's solved. If it failed at the first time, try the instructions in this section again. My second trial worked.