Closed tmhm closed 5 years ago
I found this issue before to be to do with the openGL libraries and the way they were installed. If you try doing: xvfb-run glxinfo > test.out
Do you see any errors? I had this error:
libGL error: unable to load driver: swrast_dri.so
libGL error: failed to load driver: swrast
I'm afraid i wasn;t able to resolve it on the affected machine and had to use a different one but perhaps this will help in your diagnosis of the problem. I think it's particularly a problem on multi-GPU machines from what i read.
Thanks, @mattthelee , I'm not very clear about the problem now, and, I had other error:
> xvfb-run glxinfo
name of display: :99
Error: couldn't find RGB GLX visual or fbconfig
sudo apt-get remove --purge xserver-xorg
sudo apt-get install xserver-xorg
sudo dpkg-reconfigure xserver-xorg
sudo reboot
And, the error is still exist,
~> glxinfo | grep render
direct rendering: No (If you want to find out why, try setting LIBGL_DEBUG=verbose)
OpenGL renderer string: Software Rasterizer
~> lspci | grep VGA
09:00.0 VGA compatible controller: Device 19e5:1711 (rev 01)
~> lspci -k | grep -iA3 vga
09:00.0 VGA compatible controller: Device 19e5:1711 (rev 01)
7f:08.0 System peripheral: Intel Corporation Xeon E7 v3/Xeon E5 v3/Core i7 QPI Link 0 (rev 02)
Subsystem: Device 19e5:2063
7f:08.2 Performance counters: Intel Corporation Xeon E7 v3/Xeon E5 v3/Core i7 QPI Link 0 (rev 02)
// vi /var/log/Xorg.0.log
1 [ 12.205]
2 X.Org X Server 1.18.4
3 Release Date: 2016-07-19
4 [ 12.205] X Protocol Version 11, Revision 0
5 [ 12.205] Build Operating System: Linux 4.4.0-97-generic x86_64 Ubuntu
6 [ 12.205] Current Operating System: Linux jack-XH622-V3 4.4.0-139-generic #165-Ubuntu SMP Wed Oct 24 10:58:50 UTC 2018 x86_64
7 [ 12.205] Kernel command line: BOOT_IMAGE=/boot/vmlinuz-4.4.0-139-generic root=UUID=fa475b96-05a7-4f39-a6b7-3bd7be92be1f ro quiet splash vt.handoff=7
8 [ 12.205] Build Date: 13 October 2017 01:57:05PM
9 [ 12.205] xorg-server 2:1.18.4-0ubuntu0.7 (For technical support please see http://www.ubuntu.com/support)
10 [ 12.205] Current version of pixman: 0.33.6
11 [ 12.205] Before reporting problems, check http://wiki.x.org
12 to make sure that you have the latest version.
13 [ 12.205] Markers: (--) probed, (**) from config file, (==) default setting,
14 (++) from command line, (!!) notice, (II) informational,
15 (WW) warning, (EE) error, (NI) not implemented, (??) unknown.
........
132 [ 12.914] (EE) [drm] Failed to open DRM device for (null): -22
133 [ 12.914] (EE) [drm] Failed to open DRM device for (null): -22
134 [ 12.914] (EE) [drm] Failed to open DRM device for (null): -22
135 [ 12.914] (EE) [drm] Failed to open DRM device for (null): -22
136 [ 12.914] (WW) Falling back to old probe method for modesetting
137 [ 12.914] (II) Loading sub module "fbdevhw"
138 [ 12.914] (II) LoadModule: "fbdevhw"
139 [ 12.914] (II) Loading /usr/lib/xorg/modules/libfbdevhw.so
140 [ 12.959] (II) Module fbdevhw: vendor="X.Org Foundation"
141 [ 12.959] compiled for 1.18.4, module version = 0.0.2
142 [ 12.959] ABI class: X.Org Video Driver, version 20.0
143 [ 12.959] (**) FBDEV(1): claimed PCI slot 9@0:0:0
144 [ 12.959] (II) FBDEV(1): using default device
145 [ 12.959] (WW) Falling back to old probe method for vesa
146 [ 12.959] (EE) Screen 0 deleted because of no matching config section.
......
177 [ 12.990] (II) FBDEV(0): FBIOBLANK: Invalid argument (Screen blanking not supported by kernel - disabling)
178 [ 13.013] (==) FBDEV(0): Backing store enabled
179 [ 13.024] (==) FBDEV(0): DPMS enabled
180 [ 13.024] (==) RandR enabled
181 [ 13.028] (II) SELinux: Disabled on system
182 [ 13.036] (EE) Failed to initialize GLX extension (Compatible NVIDIA X driver not found)
I'm run the command
xvfb-run -s "-ac -screen 0 1280x1024x24" python first_agent.py
on my GPU server. with thefirst_agent.py
code as minerl, GotNo OpenGL context error
, as follows,GPU #0: Name : Tesla K80 UUID : GPU-1646c64d-87d3-402a-4386-daa79b530882 PCI BusID : PCI:6:0:0
Number of Display Devices: 0
GPU #1: Name : Tesla K80 UUID : GPU-f5268c2d-3265-ea48-b8c4-ca6a32f65551 PCI BusID : PCI:7:0:0
Number of Display Devices: 0
GPU #2: Name : Tesla K80 UUID : GPU-6c44bda1-2dbd-94ea-455c-115427411c2f PCI BusID : PCI:132:0:0
Number of Display Devices: 0
GPU #3: Name : Tesla K80 UUID : GPU-0518dd7a-f291-8c42-a317-2d641749fcbc PCI BusID : PCI:133:0:0
Number of Display Devices: 0
btw, All my operations use ssh through mobaxterm. Thanks :)
And the whole log as ,