ibm-mas / ansible-devops

Ansible collection supporting devops for IBM Maximo Application Suite
https://ibm-mas.github.io/ansible-devops/
Eclipse Public License 2.0
49 stars 82 forks source link

Can't pull predefined version of NVIDIA GPU driver (nvidia_gpu role) #1384

Open SalaryTheft opened 1 month ago

SalaryTheft commented 1 month ago

https://github.com/ibm-mas/ansible-devops/blob/a4c2c3b04e4967bdc5da62b00bec2b0923aa7b33/ibm/mas_devops/roles/nvidia_gpu/defaults/main.yml#L10

스크린샷 2024-07-12 163442

I updated gpu-cluster-policy (ClusterPolicy kind) to use latest driver and it helped.

SalaryTheft commented 1 month ago

Just noticed RHOCS 4.14 is based on RHEL 9.2 https://access.redhat.com/articles/6907891


edit) For a workaround, use the GPU_DRIVER_VERSION environment variable to manually define the driver version.

export GPU_DRIVER_VERSION=550.54.14
JonahLuckett commented 1 month ago

Looking into this now for you