Open Mohamed-ben-khemis opened 5 months ago
@Mohamed-ben-khemis Can you run "kubectl get pods -n gpu-operator" to confirm that the driver is run from the operator? We don't install openGL libraries today from the driver-container. @elezar do you see any issues with the container-toolkit injecting necessary config files in this case?
@shivamerla Here are the results from running kubectl get pods -n gpu-operator:
Troubleshooting VirtualGL with NVIDIA GPU Operator in EKS
Issue Summary
Encountering issues with VirtualGL failing to detect GPUs within my EKS (Amazon Elastic Kubernetes Service) cluster using the NVIDIA GPU Operator. Despite confirming GPU presence with
nvidia-smi
, runningglxgears
with GPU acceleration usingvglrun
results in the following error:Details
vglrun -d /dev/nvidia0 glxgears
Issue
VirtualGL (vglrun) fails to initialize the 3D environment (
glxgears
) with an "Invalid EGL device" error when attempting GPU acceleration.Questions
Additional Information
nvidia-smi
within the container confirms GPU presence and functionality.terraform installation: