Closed FLM210 closed 6 months ago
I have installed gpu-operator in my cluster, and it appears that all components are running normally, but the nvidia.com/cuda.xxx is missing on a certain node。
I have found the reason, as gpu-feature-discovery run before the driver installation was completed, resulting in the inability to load NVML library
I have installed gpu-operator in my cluster, and it appears that all components are running normally, but the nvidia.com/cuda.xxx is missing on a certain node。