Open luckyyyyy opened 1 year ago
# 屏蔽nouveau 添加一句 blacklist nouveau vim /etc/modprobe.d/blacklist.conf # 修改生效 update-initramfs -u # 重启 reboot # 到nvidia官方下载对应驱动 给运行权限 chmod +x NVIDIA-Linux-x86_64-525.116.04.run # 安装 ./NVIDIA-Linux-x86_64-525.116.04.run
apt-get install gcc apt-get install make
apt-get purge nvidia* apt-get autoremove reboot
添加两行到 /etc/modules-load.d/nvidia.conf
nvidia nvidia-uvm
新建 /etc/udev/rules.d/70-nvidia.rules 添加内容
# /etc/udev/rules.d/70-nvidia.rules # Create /nvidia0, /dev/nvidia1 and /nvidiactl when nvidia module is loaded KERNEL=="nvidia", RUN+="/bin/bash -c '/usr/bin/nvidia-smi -L && /bin/chmod 666 /dev/nvidia*'" # Create the CUDA node when nvidia_uvm CUDA module is loaded KERNEL=="nvidia_uvm", RUN+="/bin/bash -c '/usr/bin/nvidia-modprobe -c0 -u && /bin/chmod 0666 /dev/nvidia-uvm*'"
重启
参考如下 使用cgroup2添加对应设备
lxc.apparmor.profile: unconfined lxc.cgroup.devices.allow: a lxc.cap.drop: lxc.cgroup2.devices.allow: c 10:200 rwm lxc.mount.entry: /dev/net/tun dev/net/tun none bind,create=file lxc.cgroup2.devices.allow: c 195:* rwm lxc.cgroup2.devices.allow: c 226:* rwm lxc.cgroup2.devices.allow: c 507:* rwm lxc.mount.entry: /dev/nvidia0 dev/nvidia0 none bind,optional,create=file lxc.mount.entry: /dev/nvidiactl dev/nvidiactl none bind,optional,create=file lxc.mount.entry: /dev/nvidia-modeset dev/nvidia-modeset none bind,optional,create=file lxc.mount.entry: /dev/nvidia-uvm dev/nvidia-uvm none bind,optional,create=file lxc.mount.entry: /dev/nvidia-uvm-tools dev/nvidia-uvm-tools none bind,optional,create=file lxc.mount.entry: /dev/dri dev/dri none bind,optional,create=dir
需要安装显卡驱动,选择 ./NVIDIA-Linux-x86_64-535.104.05.run --no-kernel-module 方式安装
./NVIDIA-Linux-x86_64-535.104.05.run --no-kernel-module
参考NVIDIA官方的安装手册 https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html
基本操作
错误
解决
安装后
添加两行到 /etc/modules-load.d/nvidia.conf
添加规则
新建 /etc/udev/rules.d/70-nvidia.rules 添加内容
重启
LXC配置
参考如下 使用cgroup2添加对应设备
LXC
需要安装显卡驱动,选择
./NVIDIA-Linux-x86_64-535.104.05.run --no-kernel-module
方式安装LXC Docker
参考NVIDIA官方的安装手册 https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html