Closed ksebby closed 11 months ago
Also having this issue, error log is identical to your comment.
Try sudo dnf install kernel-modules-extra
and try again. We moved some kernel drivers that aren't normally needed on EC2 into that optional package (in part to save space on AMIs, in part because this was the opportunity for adding things we don't want in the base setup, such as the EFI framebuffer, which are increasing boot/launch time, but are wanted by some customers). We didn't realize that newer nvidia drivers depended in DRM/gem
I had the same exact issue, and what @ozbenh suggested solved the issue.
Thanks. sudo dnf install kernel-modules-extra
did it for me.
This worked for me as well.
This worked for me as well! Thanks Alot!
Try
sudo dnf install kernel-modules-extra
and try again. We moved some kernel drivers that aren't normally needed on EC2 into that optional package (in part to save space on AMIs, in part because this was the opportunity for adding things we don't want in the base setup, such as the EFI framebuffer, which are increasing boot/launch time, but are wanted by some customers). We didn't realize that newer nvidia drivers depended in DRM/gem
How is this so buried? (and praise Google for magically pulling this thread up) Just setting up a new g5g instance and was getting the dreaded "Unable to load the kernel module 'nvidia.ko'" during driver installation. Without that command I would have probably given up, as nothing else was working.
ozbenh suggestion worked for me as well. Thanks for sharing!
sudo dnf install kernel-modules-extra
also helped me with Amazon Linux 2023 on g4dn.xlarge. Can the docs at https://docs.aws.amazon.com/en_us/AWSEC2/latest/UserGuide/install-nvidia-driver.html be updated?
sudo dnf install kernel-modules-extra
also helped me installing NVIDIA-SMI 535.129.03
on Amazon Linux 2023 on g5.xlarge.
I also hope that https://docs.aws.amazon.com/en_us/AWSEC2/latest/UserGuide/install-nvidia-driver.html could be updated, so that newcomers won't have to spend time searching for solutions.
sudo dnf install kernel-modules-extra
fixed the issue for me as well. Great piece of info that could probably be more prominently conveyed to users.
Describe the bug I used to have no problem installing nvidia drivers on G4dn instances with Amazon Linux 2023 (ami-0df435f331839b2d6, al2023-ami-2023.2.20231016.0-kernel-6.1-x86_64) but with the newer ami (ami-01bc990364452ab3e, al2023-ami-2023.2.20231026.0-kernel-6.1-x86_64) I cannot get the NVIDIA drivers to install.
To Reproduce These are the commands I use to install the NVIDIA drivers which work with the 20231016.0 kernel but not the 20231026.0 kernel. As root:
Expected behavior Expect the
./NVIDIA-Linux-x86_64-$DRIVER_VERSION.run
to run successfully but it fails with the errorThe logs do contain the same information.
Additional information I have ensured that there are no other GPU drivers installed and the NVIDIA device is recognized by the system.