NVIDIA / nvidia-container-toolkit

Build and run containers leveraging NVIDIA GPUs
Apache License 2.0
2.25k stars 245 forks source link

Unable to use sudo apt-get - new issue #170

Open michaelklachko opened 2 years ago

michaelklachko commented 2 years ago

1. Issue or feature description

sudo apt-get update                                                                                                                                     
E: Conflicting values set for option Signed-By regarding source https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64/ /: /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg !=  
E: The list of sources could not be read. 

Similar issue to https://github.com/NVIDIA/nvidia-docker/issues/1626 but the solution suggested there does not apply in my case.

2. Steps to reproduce the issue

It started with me running https://github.com/mlcommons/training/blob/master/install_cuda_docker.sh, which failed at sudo apt-get install -y nvidia-docker2 step:

Reading package lists... Done                                                                                                                                                                          
W: Target Packages (main/binary-amd64/Packages) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1  
W: Target Packages (main/binary-i386/Packages) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1   
W: Target Packages (main/binary-all/Packages) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1    
W: Target Translations (main/i18n/Translation-en) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1 
W: Target CNF (main/cnf/Commands-amd64) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1     
W: Target CNF (main/cnf/Commands-all) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1      
W: Target Packages (main/binary-amd64/Packages) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1  
W: Target Packages (main/binary-i386/Packages) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1 
W: Target Packages (main/binary-all/Packages) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1  
W: Target Translations (main/i18n/Translation-en) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1  
W: Target CNF (main/cnf/Commands-amd64) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1            
W: Target CNF (main/cnf/Commands-all) is configured multiple times in /etc/apt/sources.list:57 and /etc/apt/sources.list.d/lambda-repository.list:1  

Reading package lists... Done  
Building dependency tree       
Reading state information... Done   
Some packages could not be installed. This may mean that you have   
requested an impossible situation or if you are using the unstable   
distribution that some required packages have not yet been created  
or been moved out of Incoming.                                                                                                                                                                         
The following information may help to resolve the situation:  
The following packages have unmet dependencies:          
nvidia-docker2 : Depends: nvidia-container-toolkit (>= 1.9.0-1) but 1.7.0+dfsg-0lambda0.20.04.1 is to be installed  
E: Unable to correct problems, you have held broken packages. 

After this I ran (copied from https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#docker):

distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
      && curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \
      && curl -s -L https://nvidia.github.io/libnvidia-container/$distribution/libnvidia-container.list | \
            sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
            sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list

and now I'm getting:

sudo apt-get update                                                                                                                                     
E: Conflicting values set for option Signed-By regarding source https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64/ /: /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg !=  
E: The list of sources could not be read. 
ll /etc/apt/sources.list.d     
total 32
drwxr-xr-x 2 root root 4096 Apr 14 09:36 ./        
drwxr-xr-x 7 root root 4096 Apr 14 09:29 ../ 
-rw-r--r-- 1 root root   50 Jan 10 22:07 cuda-ubuntu2004-11-6-local.list  
-rw-r--r-- 1 root root  130 Apr 14 07:52 docker.list 
-rw-r--r-- 1 root root  110 Feb  2 15:30 lambda-repository.list    
-rw-r--r-- 1 root root  301 Apr 14 08:06 nvidia-container-toolkit.list  
-rw-r--r-- 1 root root  401 Apr 14 08:07 nvidia-docker.list 
-rw-r--r-- 1 root root   56 Jan  3 09:44 pkgs_tailscale_com_stable_ubuntu.list 
cat /etc/apt/sources.list.d/nvidia-container-toolkit.list         
deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/$(ARCH) /       
#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://nvidia.github.io/libnvidia-container/experimental/ubuntu18.04/$(ARCH) / 

3. Information to attach (optional if deemed irrelevant)

nvidia-container-cli -k -d /dev/tty info                                                                                                                                                               -- WARNING, the following logs are for debugging purposes only --

I0414 16:56:00.265530 171673 nvc.c:372] initializing library context (version=1.7.0, build=)                                                                                                           
I0414 16:56:00.265610 171673 nvc.c:346] using root /                                                                                                                                                   
I0414 16:56:00.265617 171673 nvc.c:347] using ldcache /etc/ld.so.cache                                                                                                                                 
I0414 16:56:00.265623 171673 nvc.c:348] using unprivileged user 1009:1009                                                                                                                              
I0414 16:56:00.265645 171673 nvc.c:389] attempting to load dxcore to see if we are running under Windows Subsystem for Linux (WSL)                                                                     

I0414 16:56:00.265802 171673 nvc.c:391] dxcore initialization failed, continuing assuming a non-WSL environment                                                                                        
W0414 16:56:00.330395 171674 nvc.c:269] failed to set inheritable capabilities                                                                                                                         
W0414 16:56:00.330440 171674 nvc.c:270] skipping kernel modules load due to failure                                                                                                                    
I0414 16:56:00.330704 171675 driver.c:101] starting driver service                                                                                                                                     
I0414 16:56:00.337459 171673 nvc_info.c:758] requesting driver information with ''                                                                                                                     
I0414 16:56:00.338714 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/vdpau/libvdpau_nvidia.so.510.47.03                                                                                    
I0414 16:56:00.338859 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvoptix.so.510.47.03                                                                                               
I0414 16:56:00.338894 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-tls.so.510.47.03                                                                                            
I0414 16:56:00.338915 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-rtcore.so.510.47.03                                                                                         
I0414 16:56:00.338937 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-ptxjitcompiler.so.510.47.03                                                                                 
I0414 16:56:00.338973 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-opticalflow.so.510.47.03                                                                                    
I0414 16:56:00.339008 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-opencl.so.510.47.03                                                                                         
I0414 16:56:00.339040 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-ngx.so.510.47.03                                                                                            
I0414 16:56:00.339060 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-ml.so.510.47.03                                                                                             
I0414 16:56:00.339100 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-glvkspirv.so.510.47.03                                                                                      
I0414 16:56:00.339122 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-glsi.so.510.47.03                                                                                           
I0414 16:56:00.339142 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-glcore.so.510.47.03                                                                                         
I0414 16:56:00.339167 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-fbc.so.510.47.03                                                                                            
I0414 16:56:00.339202 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-encode.so.510.47.03                                                                                         
I0414 16:56:00.339235 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-eglcore.so.510.47.03                                                                                        
I0414 16:56:00.339257 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-compiler.so.510.47.03                                                                                       
I0414 16:56:00.339278 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-cfg.so.510.47.03                                                                                            
I0414 16:56:00.339311 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvidia-allocator.so.510.47.03                                                                                      
I0414 16:56:00.339346 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libnvcuvid.so.510.47.03                                                                                               
I0414 16:56:00.339577 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libcuda.so.510.47.03                                                                                                  
I0414 16:56:00.339674 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libGLX_nvidia.so.510.47.03                                                                                            
I0414 16:56:00.339698 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libGLESv2_nvidia.so.510.47.03                                                                                         
I0414 16:56:00.339721 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libGLESv1_CM_nvidia.so.510.47.03                                                                                      
I0414 16:56:00.339744 171673 nvc_info.c:171] selecting /usr/lib/x86_64-linux-gnu/libEGL_nvidia.so.510.47.03                                                                                            
W0414 16:56:00.339794 171673 nvc_info.c:397] missing library libnvidia-nscq.so                                                                                                                         
W0414 16:56:00.339800 171673 nvc_info.c:397] missing library libnvidia-fatbinaryloader.so                                                                                                              
W0414 16:56:00.339805 171673 nvc_info.c:397] missing library libnvidia-ifr.so                                                                                                                          
W0414 16:56:00.339812 171673 nvc_info.c:397] missing library libnvidia-cbl.so                                                                                                                          
W0414 16:56:00.339817 171673 nvc_info.c:401] missing compat32 library libnvidia-ml.so                                                                                                                  
W0414 16:56:00.339823 171673 nvc_info.c:401] missing compat32 library libnvidia-cfg.so                                                                                                                 
W0414 16:56:00.339828 171673 nvc_info.c:401] missing compat32 library libnvidia-nscq.so                                                                                                                
W0414 16:56:00.339833 171673 nvc_info.c:401] missing compat32 library libcuda.so                                                                                                                       
W0414 16:56:00.339837 171673 nvc_info.c:401] missing compat32 library libnvidia-opencl.so                                                                                                              
W0414 16:56:00.339842 171673 nvc_info.c:401] missing compat32 library libnvidia-ptxjitcompiler.so                                                                                                      
W0414 16:56:00.339848 171673 nvc_info.c:401] missing compat32 library libnvidia-fatbinaryloader.so                                                                                                     
W0414 16:56:00.339853 171673 nvc_info.c:401] missing compat32 library libnvidia-allocator.so                                                                                                           
W0414 16:56:00.339857 171673 nvc_info.c:401] missing compat32 library libnvidia-compiler.so                                                                                                            
W0414 16:56:00.339862 171673 nvc_info.c:401] missing compat32 library libnvidia-ngx.so                                                                                                                 
W0414 16:56:00.339867 171673 nvc_info.c:401] missing compat32 library libvdpau_nvidia.so                                                                                                               
W0414 16:56:00.339872 171673 nvc_info.c:401] missing compat32 library libnvidia-encode.so                                                                                                                     
W0414 16:56:00.339877 171673 nvc_info.c:401] missing compat32 library libnvidia-opticalflow.so                                                                                                         
W0414 16:56:00.339884 171673 nvc_info.c:401] missing compat32 library libnvcuvid.so                                                                                                                    
W0414 16:56:00.339889 171673 nvc_info.c:401] missing compat32 library libnvidia-eglcore.so                                                                                                             
W0414 16:56:00.339895 171673 nvc_info.c:401] missing compat32 library libnvidia-glcore.so                                                                                                              
W0414 16:56:00.339902 171673 nvc_info.c:401] missing compat32 library libnvidia-tls.so                                                                                                                 
W0414 16:56:00.339907 171673 nvc_info.c:401] missing compat32 library libnvidia-glsi.so                                                                                                                
W0414 16:56:00.339912 171673 nvc_info.c:401] missing compat32 library libnvidia-fbc.so                                                                                                                 
W0414 16:56:00.339918 171673 nvc_info.c:401] missing compat32 library libnvidia-ifr.so                                                                                                                 
W0414 16:56:00.339923 171673 nvc_info.c:401] missing compat32 library libnvidia-rtcore.so                                                                                                              
W0414 16:56:00.339928 171673 nvc_info.c:401] missing compat32 library libnvoptix.so                                                                                                                    
W0414 16:56:00.339934 171673 nvc_info.c:401] missing compat32 library libGLX_nvidia.so                                                                                                                 
W0414 16:56:00.339940 171673 nvc_info.c:401] missing compat32 library libEGL_nvidia.so                                                                                                                 
W0414 16:56:00.339946 171673 nvc_info.c:401] missing compat32 library libGLESv2_nvidia.so                                                                                                              
W0414 16:56:00.339953 171673 nvc_info.c:401] missing compat32 library libGLESv1_CM_nvidia.so                                                                                                           
W0414 16:56:00.339958 171673 nvc_info.c:401] missing compat32 library libnvidia-glvkspirv.so                                                                                                           
W0414 16:56:00.339963 171673 nvc_info.c:401] missing compat32 library libnvidia-cbl.so                                                                                                                 
I0414 16:56:00.340373 171673 nvc_info.c:297] selecting /usr/bin/nvidia-smi                                                                                                                             
I0414 16:56:00.340392 171673 nvc_info.c:297] selecting /usr/bin/nvidia-debugdump                                                                                                                       
I0414 16:56:00.340407 171673 nvc_info.c:297] selecting /usr/bin/nvidia-persistenced                                                                                                                    
I0414 16:56:00.340441 171673 nvc_info.c:297] selecting /usr/bin/nvidia-cuda-mps-control                                                                                                                
I0414 16:56:00.340457 171673 nvc_info.c:297] selecting /usr/bin/nvidia-cuda-mps-server                                                                                                                 
W0414 16:56:00.340495 171673 nvc_info.c:423] missing binary nv-fabricmanager                                                                                                                           
I0414 16:56:00.340519 171673 nvc_info.c:341] listing firmware path /usr/lib/firmware/nvidia/510.47.03                                                                                                  
I0414 16:56:00.340544 171673 nvc_info.c:520] listing device /dev/nvidiactl                                                                                                                             
I0414 16:56:00.340549 171673 nvc_info.c:520] listing device /dev/nvidia-uvm                                                                                                                            
I0414 16:56:00.340554 171673 nvc_info.c:520] listing device /dev/nvidia-uvm-tools                                                                                                                      
I0414 16:56:00.340561 171673 nvc_info.c:520] listing device /dev/nvidia-modeset                                                                                                                        
I0414 16:56:00.340591 171673 nvc_info.c:341] listing ipc path /run/nvidia-persistenced/socket                                                                                                          
W0414 16:56:00.340609 171673 nvc_info.c:347] missing ipc path /var/run/nvidia-fabricmanager/socket                                                                                                     
W0414 16:56:00.340623 171673 nvc_info.c:347] missing ipc path /tmp/nvidia-mps                                                                                                                          
I0414 16:56:00.340628 171673 nvc_info.c:814] requesting device information with ''                                                                                                                     
I0414 16:56:00.348100 171673 nvc_info.c:705] listing device /dev/nvidia0 (GPU-fcc8f71e-ab7d-5d4c-7bdf-9e7f2af89aa4 at 
00000000:03:00.0)                                                                
I0414 16:56:00.355306 171673 nvc_info.c:705] listing device /dev/nvidia1 (GPU-d49522ac-ffe6-9b4d-12bb-f93ae70c12f1 at 
00000000:04:00.0)                                                                
NVRM version:   510.47.03                                                                                                                                                                              
CUDA version:   11.6                                                                                                                                                                                                                                                                                                                                                                                          
Device Index:   0                                                                                                                                                                                      
Device Minor:   0                                                                                                                                                                                      
Model:          NVIDIA A40                                                                                                                                                                             
Brand:          Unknown                                                                                                                                                                                
GPU UUID:       GPU-fcc8f71e-ab7d-5d4c-7bdf-9e7f2af89aa4                                                                                                                                               
Bus Location:   00000000:03:00.0                                                                                                                                                                       
Architecture:   8.6                                                                                                                                                                                                                                                                                                                                                                                           
Device Index:   1                                                                                                                                                                                      
Device Minor:   1                                                                                                                                                                                      
Model:          NVIDIA A40                                                                                                                                                                             
Brand:          Unknown                                                                                                                                                                                
GPU UUID:       GPU-d49522ac-ffe6-9b4d-12bb-f93ae70c12f1                                                                                                                                               
Bus Location:   00000000:04:00.0                                                                                                                                                                       
Architecture:   8.6                                                                                                                                                                                    
I0414 16:56:00.355335 171673 nvc.c:423] shutting down library context                                                                                                                                  
I0414 16:56:00.357199 171675 driver.c:163] terminating driver service                                                                                                                                  
I0414 16:56:00.357906 171673 driver.c:203] driver service terminated successfully 
docker info

Client:                                                                                                                                                                                                 
Context:    default                                                                                                                                                                                    
Debug Mode: false                                                                                                                                                                                      
Plugins:                                                                                                                                                                                                

app: Docker App (Docker Inc., v0.9.1-beta3)                                                                                                                                                            
buildx: Docker Buildx (Docker Inc., v0.8.1-docker)                                                                                                                                                     
scan: Docker Scan (Docker Inc., v0.17.0)                                                                                                                                                                                                                                                                                                                                                                    
Server:                                                                                                                                                                                                 
Containers: 1                                                                                                                                                                                           
Running: 0                                                                                                                                                                                             
Paused: 0                                                                                                                                                                                              
Stopped: 1                                                                                                                                                                                            
Images: 69                                                                                                                                                                                             
Server Version: 20.10.14                                                                                                                                                                               
Storage Driver: overlay2                                                                                                                                                                                
Backing Filesystem: extfs                                                                                                                                                                              
Supports d_type: true                                                                                                                                                                                  
Native Overlay Diff: true                                                                                                                                                                              
userxattr: false                                                                                                                                                                                      
Logging Driver: json-file                                                                                                                                                                              
Cgroup Driver: cgroupfs                                                                                                                                                                                
Cgroup Version: 1                                                                                                                                                                                      
Plugins:                                                                                                                                                                                                
Volume: local                                                                                                                                                                                          
Network: bridge host ipvlan macvlan null overlay                                                                                                                                                       
Log: awslogs fluentd gcplogs gelf journald json-file local logentries splunk syslog                                                                                                                   
Swarm: inactive                                                                                                                                                                                        
Runtimes: io.containerd.runc.v2 io.containerd.runtime.v1.linux runc                                                                                                                                    
Default Runtime: runc                                                                                                                                                                                  
Init Binary: docker-init                                                                                                                                                                               
containerd version: 3df54a852345ae127d1fa3092b95168e4a88e2f8                                                                                                                                           
runc version: v1.0.3-0-gf46b6ba                                                                                                                                                                        
init version: de40ad0                                                                                                                                                                                  
Security Options:                                                                                                                                                                                       
apparmor                                                                                                                                                                                               
seccomp                                                                                                                                                                                                 
Profile: default                                                                                                                                                                                     

Kernel Version: 5.4.0-107-generic                                                                                                                                                                      
Operating System: Ubuntu 20.04.4 LTS                                                                                                                                                                   
OSType: linux                                                                                                                                                                                          
Architecture: x86_64                                                                                                                                                                                   
CPUs: 128                                                                                                                                                                                              

Total Memory: 503.7GiB                                                                                                                                                                                 
Name: hq-prdgsv01                                                                                                                                                                                      
ID: Y6TB:QCVX:SWVO:ZAIK:N473:DCDE:HCKZ:QEPW:YZMP:3OEH:56AO:OP2O                                                                                                                                        
Docker Root Dir: /var/lib/docker                                                                                                                                                                       
Debug Mode: false                                                                                                                                                                                      
Registry: https://index.docker.io/v1/                                                                                                                                                                  
Labels:                                                                                                                                                                                                
Experimental: false                                                                                                                                                                                    
Insecure Registries:                                                                                                                                                                                    
127.0.0.0/8                                                                                                                                                                                           

Live Restore Enabled: false                                                                                                                                                                                                                                                                                                                                                                                  
WARNING: No swap limit support
dpkg-query: no packages found matching *nvidia*rpm                                                                                                                                            
[20/1859]dpkg-query: no packages found matching -qa                                                                                                                                                             
Desired=Unknown/Install/Remove/Purge/Hold                                                                                                                                                              
| Status=Not/Inst/Conf-files/Unpacked/halF-conf/Half-inst/trig-aWait/Trig-pend                                                                                                                         
|/ Err?=(none)/Reinst-required (Status,Err: uppercase=bad)                                                                                                                                             
||/ Name                          Version                     Architecture Description                                                                                                                 
+++-=============================-===========================-============-=========
===============================================================                                                    

un  libgldispatch0-nvidia         <none>                      <none>       (no description available)                        
ii  libnvidia-cfg1-510:amd64      510.60.02-0lambda0~20.04.1  amd64        NVIDIA binary OpenGL/GLX configuration library                                                        
un  libnvidia-cfg1-any            <none>                      <none>       (no description available)                                                                                                  
un  libnvidia-common              <none>                      <none>       (no description available)                                                                                                  
ii  libnvidia-common-510          510.60.02-0lambda0~20.04.1  all          Shared files used by the NVIDIA libraries                                                                                   
ii  libnvidia-compute-418:amd64   430.50-0ubuntu3             amd64        Transitional package for libnvidia-compute-430                                                                              
ii  libnvidia-compute-430:amd64   440.100-0ubuntu0.20.04.1    amd64        Transitional package for libnvidia-compute-440                                                                              
ii  libnvidia-compute-440:amd64   450.66-0lambda0~20.04.2     amd64        Transitional package for libnvidia-compute-450                                                                              
ii  libnvidia-compute-450:amd64   455.45.01-0lambda0~20.04.1  amd64        Transitional package for libnvidia-compute-455                                                                              
ii  libnvidia-compute-455:amd64   460.84-0lambda0~20.04.1     amd64        Transitional package for libnvidia-compute-460                                                                              
ii  libnvidia-compute-460:amd64   470.103.01-0lambda0~20.04.2 amd64        Transitional package for libnvidia-compute-470                                                                              
ii  libnvidia-compute-470:amd64   510.54-0lambda0~20.04.1     amd64        Transitional package for libnvidia-compute-510                                                                              
ii  libnvidia-compute-510:amd64   510.60.02-0lambda0~20.04.1  amd64        NVIDIA libcompute package                                                                                                   
ii  libnvidia-compute-510:i386    510.60.02-0lambda0~20.04.1  i386         NVIDIA libcompute package 
ii  libnvidia-compute-510:i386    510.60.02-0lambda0~20.04.1  i386         NVIDIA libcompute package                                                                                                   
ii  libnvidia-container-tools     1.7.0+ds-0lambda1           amd64        Package for configuring containers with NVIDIA hardware 
(CLI tool)                                                          
ii  libnvidia-container1:amd64    1.7.0+ds-0lambda1           amd64        Package for configuring containers with NVIDIA hardware (shared library)                                                    
un  libnvidia-decode             <none>                      <none>       (no description available)                                                                                                  
ii  libnvidia-decode-510:amd64    510.60.02-0lambda0~20.04.1  amd64        NVIDIA Video Decoding runtime libraries                                                                                     
ii  libnvidia-decode-510:i386     510.60.02-0lambda0~20.04.1  i386         NVIDIA Video Decoding runtime libraries                                                                                     
un  libnvidia-encode              <none>                      <none>       (no description available)                                                                                                  
ii  libnvidia-encode-510:amd64    510.60.02-0lambda0~20.04.1  amd64        NVENC Video Encoding runtime library                                                                                        
ii  libnvidia-encode-510:i386     510.60.02-0lambda0~20.04.1  i386         NVENC Video Encoding runtime library                                                                                        
un  libnvidia-extra               <none>                      <none>       (no description available)                                                                                                  
ii  libnvidia-extra-510:amd64     510.60.02-0lambda0~20.04.1  amd64        Extra libraries for the NVIDIA driver                                                                                       
ii  libnvidia-extra-510:i386      510.60.02-0lambda0~20.04.1  i386         Extra libraries for the NVIDIA driver                                                                                       
un  libnvidia-fbc1                <none>                      <none>       (no description available)                                                                                                  
ii  libnvidia-fbc1-510:amd64      510.60.02-0lambda0~20.04.1  amd64        NVIDIA OpenGL-based Framebuffer Capture runtime 
library                                                                     
ii  libnvidia-fbc1-510:i386       510.60.02-0lambda0~20.04.1  i386         NVIDIA 
OpenGL-based Framebuffer Capture runtime library                                                                     
un  libnvidia-gl                  <none>                      <none>       (no description available)                                                                                                  ii  
libnvidia-gl-510:amd64        510.60.02-0lambda0~20.04.1  amd64        NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and 
Vulkan ICD                                                                   
ii  libnvidia-gl-510:i386         510.60.02-0lambda0~20.04.1  i386         
NVIDIA OpenGL/GLX/EGL/GLES GLVND libraries and Vulkan ICD                                                                   
ii  libnvidia-ml-dev      11.1.1-0lambda2             amd64        NVIDIA Management Library (NVML) development package                                                                        
un  libnvidia-ml1                 <none>                      <none>       (no description available)                                                                                                  
un  nvidia-compute-utils          <none>                      <none>       (no description available)                                                                                                  
ii  nvidia-compute-utils-510      510.60.02-0lambda0~20.04.1  amd64        NVIDIA compute utilities                                                                                                    
un  nvidia-container-runtime-hook <none>                      <none>       (no description available)                                                                                                  
ii  nvidia-container-toolkit      1.7.0+dfsg-0lambda0.20.04.1 amd64        OCI hook for configuring containers for NVIDIA 
hardware                                                                     
ii  nvidia-cuda-dev:amd64         11.1.1-0lambda2             amd64        CUDA development files                                                                                                      
un  nvidia-cuda-doc               <none>    <none>       (no description available)                                                                                                  
un  nvidia-cuda-gdb          <none>                      <none>       (no description available)                                                                                                  
ii  nvidia-cuda-toolkit           11.1.1-0lambda2             amd64        CUDA development toolkit                                                                                                    
ii  nvidia-dkms-510               510.60.02-0lambda0~20.04.1  amd64        NVIDIA DKMS package                                                                                                         
un  nvidia-dkms-kernel            <none>                      <none>       (no description available)                                                                                                  
un  nvidia-driver-455             <none>                      <none>       (no description available)                                                                                                  
ii  nvidia-driver-510             510.60.02-0lambda0~20.04.1  amd64        NVIDIA driver metapackage                                                                                                   
un  nvidia-driver-binary          <none>                      <none>       (no description available)                                                                                                  
un  nvidia-driver-meta            <none>                      <none>       (no description available)                                                                                                  
ii  nvidia-headless-460           470.103.01-0lambda0~20.04.2 amd64        Transitional package for nvidia-headless-470                                                                                
ii  nvidia-headless-470           510.54-0lambda0~20.04.1     amd64        Transitional package for nvidia-headless-510                                                                                
ii  nvidia-headless-510           510.60.02-0lambda0~20.04.1  amd64        NVIDIA headless metapackage                                                                                                 
un  nvidia-headless-meta          <none>                      <none>       (no description available)                                                                                                  
ii  nvidia-headless-no-dkms-510   510.60.02-0lambda0~20.04.1  amd64        NVIDIA headless metapackage - no DKMS                                                                                       
un  nvidia-headless-no-dkms-meta  <none>                      <none>       (no description available)                                                                                                  
un  nvidia-kernel-common          <none>                      <none>       (no description available)                                                                                                  
ii  nvidia-kernel-common-510      510.60.02-0lambda0~20.04.1  amd64        Shared files used with the kernel module                                                                                    
un  nvidia-kernel-source          <none>                      <none>       (no description available)                                                                                                  
ii  nvidia-kernel-source-510      510.60.02-0lambda0~20.04.1  amd64        NVIDIA kernel source package                                                                                                
un  nvidia-libopencl1-dev         <none>                      <none>       (no description available)                                                                                                  
ii  nvidia-modprobe               510.60.02-0lambda1          amd64        utility to load NVIDIA kernel modules and create device 
nodes                                                               
un  nvidia-opencl-icd             <none>                      <none>       (no description available)                                                                                                  
un  nvidia-persistenced           <none>         <none>       (no description available)                                                                                                  
un  nvidia-prime              <none>                      <none>       (no description available)                                                                                                 
ii  nvidia-profiler               11.1.1-0lambda2             amd64        NVIDIA CUDA profiler                                                                                                        
ii  nvidia-settings               510.60.02-0lambda1          amd64        Tool for configuring the NVIDIA graphics driver                                                                             
un  nvidia-settings-binary        <none>                      <none>       (no description available)                                                                                                  
un  nvidia-smi                    <none>                      <none>       (no description available)                                                                                                  
un  nvidia-utils                  <none>                      <none>       (no description available)                                                                                                  
ii  nvidia-utils-510              510.60.02-0lambda0~20.04.1  amd64        NVIDIA driver support binaries                                                                                              
ii  xserver-xorg-video-nvidia-510 510.60.02-0lambda0~20.04.1  amd64        NVIDIA binary Xorg driver 
cli-version: 1.7.0                                                                                                                                                                                     
lib-version: 1.7.0                                                                                                                                                                                     
build date: 2022-01-26T00:20+00:00                                                                                                                                                                     
build revision:                                                                                                                                                                                        
build compiler: gcc-5 5.4.0 20160609                                                                                                                                                                   
build platform: x86_64                                                                                                                                                                                 
build flags: -D_GNU_SOURCE -D_FORTIFY_SOURCE=2 -Wdate-time -D_FORTIFY_SOURCE=2 -DNDEBUG -std=gnu11 -O2 -g 
-fdata-sections -ffunction-sections -fstack-protector -fno-strict-aliasing -fvisibility=hidden -Wall -Wextra -Wcast-align 
-Wpointer-arith -Wmissing-prototypes -Wnonnull -Wwrite-strings -Wlogical-op -Wformat=2 -Wmissing-format-attribute 
-Winit-self -Wshadow -Wstrict-prototypes -Wunreachable-code -Wconversion -Wsign-conversion -Wno-unknown-warning-
option -Wno-format-extra-args -Wno-gnu-alignof-expression -g -O2 -fstack-protector-strong -Wformat -Werror=format-
security -Wl,-zrelro -Wl,-znow -Wl,-zdefs -Wl,--gc-sections -Wl,-Bsymbolic-functions -Wl,-z,relro 
elezar commented 2 years ago

Similar issue to https://github.com/NVIDIA/nvidia-docker/issues/1626 but the solution suggested there does not apply in my case.

You also seem to have an nvidia-docker.list file in your /etc/apt/sources.list.d directory. What are the contents of this file?

Since your warnings also mention the lambda-repository.list file, what are the contents of this file?

The repository with a signed-by setting must be specified uniquely.

elezar commented 2 years ago

@michaelklachko were you able to resolve your issue?

Note, we also added documentation on how this could be resolved. Please have a look and update this issue accordingly.

Ashish-25-11 commented 2 years ago

Conflicting values set for option Signed-By regarding source https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64/ /: /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg != E: The list of sources could not be read. I am getting this error

elezar commented 2 years ago

@Ashish-25-11 please read the section in our documentation on this.

gSingh-maker commented 1 year ago

I am trying to build colmap using docker, as mentioned here https://github.com/colmap/colmap/tree/dev/docker. Its supposed to be easy process, as I have already done it in other machine. It gives same error as mentioned in this issue. I followed the procedure to remove multiple references. here is the snippet:

visos@MS-7B45-59:/etc/apt/sources.list.d$ grep -l "nvidia.github.io" *
nvidia-container-toolkit.list

When i run the script to setup the nvidia-toolkit, inside docker, it reports same error again

visos@MS-7B45-59:/opt/hdd_1TB/colmap/docker$ ./setup-ubuntu.sh
[sudo] password for visos:
OK
deb https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/$(ARCH) /
#deb https://nvidia.github.io/libnvidia-container/experimental/ubuntu18.04/$(ARCH) /
deb https://nvidia.github.io/nvidia-container-runtime/stable/ubuntu18.04/$(ARCH) /
#deb https://nvidia.github.io/nvidia-container-runtime/experimental/ubuntu18.04/$(ARCH) /
deb https://nvidia.github.io/nvidia-docker/ubuntu18.04/$(ARCH) /
E: Conflicting values set for option Signed-By regarding source https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/amd64/ /: /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg !=
E: The list of sources could not be read.
docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]].
ERRO[0002] error waiting for container:

Does the line in error docker: Error response from daemon: could not select device driver "" with capabilities: [[gpu]]. hint anything ?

Inside /etc/apt/sources.list.d/ directory, "nvidia-docker.list" gets created again visos@MS-7B45-59:/etc/apt/sources.list.d$ grep -l "nvidia.github.io" * nvidia-container-toolkit.list nvidia-docker.list

elezar commented 1 year ago

This seems to be caused by https://github.com/colmap/colmap/blob/d812bcbf5a9a13c52292b711a5c1d4a55dfe5812/docker/setup-ubuntu.sh#L4

Please create a PR against that repo to update the files to match the new instructions.

elezar commented 10 months ago

@gSingh-maker were you able to get this working?