Closed rnd-ash closed 2 years ago
Hi, I couldn't repro this with 5.18.0-arch1-1 from testing. Can you try with nvidia-open-dkms-515.43.04-8 just to be sure that the kernel module was rebuilt with the matching kernel headers?
I was using the nvidia-open-dkms
package, had run mkinitcpio
multiple times and DKMS said it was installing the open modules for 5.18, so I assume I had matching headers
Downgraded to kernel 5.17.9-arch1-1
and everything works for me
@rnd-ash: Could you experiment to see if the same ENDBR error happens with:
(1) the open kernel modules packaged with the NVIDIA .run file (i.e., install from .run file with -m=kernel-open
)
(2) the closed kernel modules packaged with the NVIDIA .run file (i.e., install from .run file with -m=kernel
)
I'm curious if the problem has something to do with how the open nvidia.ko was built by arch-linux (maybe something about the toolchain used). I think experiments (1) and (2) should help shake that out.
It looks like ENDBR is new in 5.18. I wonder if the problem here only manifests with certain kernel kconfigs. E.g., maybe it requires X86_KERNEL_IBT
From archlinux's config file, I can see that on the problematic kernel version, X86_KERNEL_IBT
is enabled here
I tried to download the .run
file from https://us.download.nvidia.com/XFree86/Linux-x86_64/515.43.04/NVIDIA-Linux-x86_64-515.43.04.run
, but every time I tried to run it I kept getting installation failed.
However, I switched over to try both the nvidia-open-dkms and nvidia-dkms packages from arch (PKGBUILDs can be seen here and here), and they all result in the same ENDBR error.
Does it work if you pass the kernel ' ibt=off' ?
Does it work if you pass the kernel ' ibt=off' ?
Just tried it, it does!
ibt=off also works for me from grub. I am running Arcolinux on a Dell XPS 9520 (NVIDIA 3050 and 16 GB ram). I am also running NVIDIA Prime. Any idea what the issue may be?
GPU: NVIDIA RTX 3060 laptop Driver Version: Closed NVIDIA Driver 515.43.04 I also faced this problem.If you updated Kernel to Linux 5.18,will display "Failed start to Linux Kernel".
Does it work if you pass the kernel ' ibt=off' ?
Perfect fix😉
Same issue here. Kernel: 5.18 - GPU Driver: NVIDIA 515.43.04 - Rtx 3080 desktop
@atiensivu thank you, what does ibt=off do? It boots if I use it
I also faced this problem. Kernel: 5.18.0-arch1-1 GPU Driver: NVIDIA 515.43.04 - RTX 3070 (laptop)
Optimus-manager failed.
Adding ibt=off
to bootloader (grub for me) fixed it
what does ibt=off do?
Add support for Intel CET-IBT (Indirect Branch Tracking), a hardware support course-grain forward-edge Control Flow Integrity protection. It enforces that all indirect calls must land on an ENDBR instruction, as such, the compiler will instrument the code with them to make this happen.
Anyone knows how to add parameter this when using EFIStub?
Edit: Downgraded my kernel for now. I found this issue after my boot was hanging on "start job is running for Load Kernel Modules". To downgrade:
cd /var/cache/pacman/pkg
pacman -U file://linux-5.17.5.arch1-1-x86_64.pkg.tar.zst
same goes for me
Kernel: 5.18.0-arch1-1 GPU Driver: NVIDIA 515.43.04 - RTX 2080
Happens to me too Kernel: 5.18.0-arch1-1 Driver: NVIDIA 515.43.04 GPU: MX450
Setting ibt=off fixes the issue temporarily, but cannot be considered a full solution.
I can confirm, I have the same issue. Arch Linux 5.18.0-zen1-1-zen Nvidia RTX 5000
How can we fix this issue?
How can we fix this issue?
@CryptLabs You can temporarily fix the issue by adding ibt=off
to your kernel command line parameters
I can confirm the same issue on my laptop.
Arch Linux 5.18.0-arch1-1 Driver Version: 515.43.04
RTX 3070 Laptop
@mahancoder I have used ibt=off
.
However, as you said, I feel that this is not a good solution.
please fix
I just installed latest nvidia dkms drivers (515.43.04-2) and the fix does not work anymore
5.18.0-arch1-1 nvidia-dkms 515.43.04-2 Nvidia GTX1080
Same problem here latest Nvidia driver literally broke my system. I was stuck on Reached target Graphical Interface
. And received several errors. Only solution to interact with the system was CTRL+ALT+F2
.
What fixed the Issue:
Either uninstall everything all Nvidia packages or pass the ibt=off
flag to the kernel parameters:
I just installed latest nvidia drivers (515.43.04-6) and the fix does not work anymore
Did you try to remove the ibt=off
flag when using the latest Nvidia driver?
I had the same issue this morning when updating the drivers. It seems like ibt=off
was removed from my kernel options in my latest update, when I re added it back the drivers started working again.
There appears to be an open bug now on Archlinux about this issue https://bugs.archlinux.org/task/74891
@m1guelperez yes, the first thing I've done is to remove and reboot but still not working.
Also, I'm not sure it's related, but even with nvidia drivers loading properly, HDMI does not seems to work with the workaround (my HDMI is wired to my GPU)
The following patch will insert the necessary endbr64
instructions:
diff --git a/src/nvidia-modeset/Makefile b/src/nvidia-modeset/Makefile
index c63b86b..69490d0 100644
--- a/src/nvidia-modeset/Makefile
+++ b/src/nvidia-modeset/Makefile
@@ -95,7 +95,6 @@ CFLAGS += -ffunction-sections
CFLAGS += -fdata-sections
CFLAGS += -ffreestanding
-CONDITIONAL_CFLAGS := $(call TEST_CC_ARG, -fcf-protection=none)
CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -Wformat-overflow=2)
CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -Wformat-truncation=1)
ifeq ($(TARGET_ARCH),x86_64)
diff --git a/src/nvidia/Makefile b/src/nvidia/Makefile
index 9bdb826..cc05ab7 100644
--- a/src/nvidia/Makefile
+++ b/src/nvidia/Makefile
@@ -119,8 +119,6 @@ CFLAGS += -fdata-sections
NV_KERNEL_O_LDFLAGS += --gc-sections
EXPORTS_LINK_COMMAND = exports_link_command.txt
-CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -fcf-protection=none)
-
ifeq ($(TARGET_ARCH),x86_64)
CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -mindirect-branch-register)
CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -mindirect-branch=thunk-extern)
Is there anyone facing these problems that can try rebuilding the modules with the patch and report back?
I'm not sure why the -fcf-protection=none
is there in the first place, but I expect it was an attempt to minimize the code size.
@m1guelperez yes, the first thing I've done is to remove and reboot but still not working.
Also, I'm not sure it's related, but even with nvidia drivers loading properly, HDMI does not seems to work with the workaround (my HDMI is wired to my GPU)
Hmm, I can't help you there since I use DP. But I will definitely wait with any updates for now. 😄
I can confirm the issue.
i5-11400h + RTX 3050ti laptop nvidia-dkms 515.43.04-2 + 5.18.zen1-1
ibt=off fixes the issue
The following patch will insert the necessary
endbr64
instructions:diff --git a/src/nvidia-modeset/Makefile b/src/nvidia-modeset/Makefile index c63b86b..69490d0 100644 --- a/src/nvidia-modeset/Makefile +++ b/src/nvidia-modeset/Makefile @@ -95,7 +95,6 @@ CFLAGS += -ffunction-sections CFLAGS += -fdata-sections CFLAGS += -ffreestanding -CONDITIONAL_CFLAGS := $(call TEST_CC_ARG, -fcf-protection=none) CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -Wformat-overflow=2) CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -Wformat-truncation=1) ifeq ($(TARGET_ARCH),x86_64) diff --git a/src/nvidia/Makefile b/src/nvidia/Makefile index 9bdb826..cc05ab7 100644 --- a/src/nvidia/Makefile +++ b/src/nvidia/Makefile @@ -119,8 +119,6 @@ CFLAGS += -fdata-sections NV_KERNEL_O_LDFLAGS += --gc-sections EXPORTS_LINK_COMMAND = exports_link_command.txt -CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -fcf-protection=none) - ifeq ($(TARGET_ARCH),x86_64) CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -mindirect-branch-register) CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -mindirect-branch=thunk-extern)
Is there anyone facing these problems that can try rebuilding the modules with the patch and report back?
I'm not sure why the
-fcf-protection=none
is there in the first place, but I expect it was an attempt to minimize the code size.
i tried to rebuild and install modules with this patch problem still exists
ibt=off still allows you to boot normally
May 30 20:52:25 laptop kernel: ---[ end trace 0000000000000000 ]---
May 30 20:52:25 laptop kernel: Modules linked in: nvidia(OE+) i915 intel_gtt drm_buddy video drm_dp_helper ttm btrfs blake2b_generic libcrc32c crc32c_generic crc32c_intel xor raid6_pq
May 30 20:52:25 laptop kernel: </TASK>
May 30 20:52:25 laptop kernel: R13: 000055a5562dab40 R14: 000055a5562dacb0 R15: 000055a5562dd860
May 30 20:52:25 laptop kernel: R10: 0000000000000003 R11: 0000000000000246 R12: 000055a5562db4d0
May 30 20:52:25 laptop kernel: RBP: 0000000000060000 R08: 0000000000000000 R09: 00007ffd6869c880
May 30 20:52:25 laptop kernel: RDX: 0000000000000000 RSI: 000055a5562db4d0 RDI: 0000000000000003
May 30 20:52:25 laptop kernel: RAX: ffffffffffffffda RBX: 000055a5562dacb0 RCX: 00007fae296df67d
May 30 20:52:25 laptop kernel: RSP: 002b:00007ffd6869c748 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
May 30 20:52:25 laptop kernel: Code: 5d c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d eb 26 0f >
May 30 20:52:25 laptop kernel: RIP: 0033:0x7fae296df67d
May 30 20:52:25 laptop kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae
May 30 20:52:25 laptop kernel: ? do_syscall_64+0x6b/0x90
May 30 20:52:25 laptop kernel: ? syscall_exit_to_user_mode+0x26/0x50
May 30 20:52:25 laptop kernel: ? __x64_sys_lseek+0x6d/0xc0
May 30 20:52:25 laptop kernel: do_syscall_64+0x5c/0x90
May 30 20:52:25 laptop kernel: __x64_sys_finit_module+0xc1/0x130
May 30 20:52:25 laptop kernel: do_init_module+0x4a/0x240
May 30 20:52:25 laptop kernel: do_one_initcall+0x118/0x2d0
May 30 20:52:25 laptop kernel: nvidia_frontend_init_module+0x50/0x91 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: ? nvidia_init_module+0x627/0x627 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: nvidia_init_module+0x22e/0x627 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: rm_init_rm+0x9/0x10 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: RmInitRm+0x9/0x20 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: coreInitializeRm+0x24/0x90 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: portInitialize+0x2b/0x40 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: portCryptoInitialize+0x2a/0x40 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: portCryptoPseudoRandomGeneratorCreate+0x16/0xb0 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: ? nvidia_init_module+0x627/0x627 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: _portMemAllocatorAlloc+0x2e/0x170 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: ? portCryptoPseudoRandomGeneratorGetU32+0x30/0x30 [nvidia 416207d86ba54fc0cbe32354e28a12d664d17d2d]
May 30 20:52:25 laptop kernel: R13: 0000000000000000 R14: ffffb4d640687ca6 R15: 0000000000000000
May 30 20:52:25 laptop kernel: R10: ffffffffc0d8d610 R11: 0000000000000000 R12: ffffffffc0d8fbb0
May 30 20:52:25 laptop kernel: RBP: ffffb4d640687b70 R08: 0000000000000020 R09: ffffffffc0d8fc30
May 30 20:52:25 laptop kernel: RDX: 0000000000000000 RSI: 000000000000002c RDI: ffffffffc0d8fbb0
May 30 20:52:25 laptop kernel: RAX: ffffffffc0b46280 RBX: 0000000000000010 RCX: 0000000000000000
May 30 20:52:25 laptop kernel: RSP: 0018:ffffb4d640687b58 EFLAGS: 00010202
May 30 20:52:25 laptop kernel: Code: 08 48 89 d0 48 89 0f 48 c1 e0 17 48 31 c2 48 89 c8 48 c1 e8 05 48 31 c8 48 31 d0 48 c1 ea 12 48 31 d0 48 89 47 08 01 c8 c3 90 <48> 89 f7 e9 38 0f 00 00 0f 1f 84 00 00 00 00 >
May 30 20:52:25 laptop kernel: RIP: 0010:_portMemAllocatorAllocNonPagedWrapper+0x0/0x10 [nvidia]
May 30 20:52:25 laptop kernel: asm_exc_control_protection+0x22/0x30
May 30 20:52:25 laptop kernel: <TASK>
May 30 20:52:25 laptop kernel: Call Trace:
May 30 20:52:25 laptop kernel: PKRU: 55555554
May 30 20:52:25 laptop kernel: CR2: 000055a5562dd000 CR3: 00000001032da005 CR4: 0000000000f70ee0
May 30 20:52:25 laptop kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 30 20:52:25 laptop kernel: FS: 00007fae295d1740(0000) GS:ffff9e5d60300000(0000) knlGS:0000000000000000
May 30 20:52:25 laptop kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
May 30 20:52:25 laptop kernel: R10: ffffffffac25aa20 R11: ffff9e59c116f600 R12: 0000000000000000
May 30 20:52:25 laptop kernel: RBP: 0000000000000003 R08: 0000000000000001 R09: 00000000ffffffea
May 30 20:52:25 laptop kernel: RDX: 0000000000000000 RSI: 00000000ffffefff RDI: 0000000000000003
May 30 20:52:25 laptop kernel: RAX: 000000000000004d RBX: ffffb4d640687aa8 RCX: 0000000000000000
May 30 20:52:25 laptop kernel: RSP: 0018:ffffb4d640687a88 EFLAGS: 00010002
May 30 20:52:25 laptop kernel: Code: 8b 93 80 00 00 00 be f9 00 00 00 48 c7 c7 5e 8c a6 ab e8 71 80 30 ff e9 72 ff ff ff 48 c7 c7 45 8c a6 ab e8 35 2f fa ff 0f 0b <0f> 0b 66 66 2e 0f 1f 84 00 00 00 00 00 90 66 >
May 30 20:52:25 laptop kernel: RIP: 0010:exc_control_protection+0xc2/0xd0
May 30 20:52:25 laptop kernel: Hardware name: Acer Nitro AN515-57/Scala_TLS, BIOS V1.11 09/28/2021
May 30 20:52:25 laptop kernel: CPU: 4 PID: 191 Comm: modprobe Tainted: G W OE 5.18.0-zen1-1-zen #1 8c1b4772d057e8d6ef1ec6c49ac9700bcd2a2e4e
May 30 20:52:25 laptop kernel: invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
May 30 20:52:25 laptop kernel: kernel BUG at arch/x86/kernel/traps.c:252!
May 30 20:52:25 laptop kernel: ------------[ cut here ]------------
May 30 20:52:25 laptop kernel: traps: Missing ENDBR: _portMemAllocatorAllocNonPagedWrapper+0x0/0x10 [nvidia]
May 30 20:52:25 laptop kernel:
May 30 20:52:25 laptop kernel: nvidia-nvlink: Nvlink Core is being initialized, major device number 236
@TheBakerCat please try the following patch from https://bugs.archlinux.org/task/74886#comment208651
diff --git a/src/nvidia-modeset/Makefile b/src/nvidia-modeset/Makefile
index c63b86b..1e92bb0 100644
--- a/src/nvidia-modeset/Makefile
+++ b/src/nvidia-modeset/Makefile
@@ -95,7 +95,8 @@ CFLAGS += -ffunction-sections
CFLAGS += -fdata-sections
CFLAGS += -ffreestanding
-CONDITIONAL_CFLAGS := $(call TEST_CC_ARG, -fcf-protection=none)
+CONDITIONAL_CFLAGS := $(call TEST_CC_ARG, -fcf-protection=branch -mindirect-branch-register)
+CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -mharden-sls=all)
CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -Wformat-overflow=2)
CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -Wformat-truncation=1)
ifeq ($(TARGET_ARCH),x86_64)
diff --git a/src/nvidia/Makefile b/src/nvidia/Makefile
index 9bdb826..3f1e330 100644
--- a/src/nvidia/Makefile
+++ b/src/nvidia/Makefile
@@ -119,7 +119,8 @@ CFLAGS += -fdata-sections
NV_KERNEL_O_LDFLAGS += --gc-sections
EXPORTS_LINK_COMMAND = exports_link_command.txt
-CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -fcf-protection=none)
+CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -fcf-protection=branch -mindirect-branch-register)
+CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -mharden-sls=all)
ifeq ($(TARGET_ARCH),x86_64)
CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -mindirect-branch-register)
Patch works for me without the ibt=off
parameter (GTX 1650).
This is the beauty of open source. The exact cause and fix of the issue was found by the community, and even if NVIDIA for some reason refused to merge it, distros could just include the patch in their packages.
If this was still the proprietary driver only, we would all have to include ibt=off
until someone at NVIDIA found the issue and incorporated it into the next release.
@TheBakerCat please try the following patch from https://bugs.archlinux.org/task/74886#comment208651
diff --git a/src/nvidia-modeset/Makefile b/src/nvidia-modeset/Makefile index c63b86b..1e92bb0 100644 --- a/src/nvidia-modeset/Makefile +++ b/src/nvidia-modeset/Makefile @@ -95,7 +95,8 @@ CFLAGS += -ffunction-sections CFLAGS += -fdata-sections CFLAGS += -ffreestanding -CONDITIONAL_CFLAGS := $(call TEST_CC_ARG, -fcf-protection=none) +CONDITIONAL_CFLAGS := $(call TEST_CC_ARG, -fcf-protection=branch -mindirect-branch-register) +CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -mharden-sls=all) CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -Wformat-overflow=2) CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -Wformat-truncation=1) ifeq ($(TARGET_ARCH),x86_64) diff --git a/src/nvidia/Makefile b/src/nvidia/Makefile index 9bdb826..3f1e330 100644 --- a/src/nvidia/Makefile +++ b/src/nvidia/Makefile @@ -119,7 +119,8 @@ CFLAGS += -fdata-sections NV_KERNEL_O_LDFLAGS += --gc-sections EXPORTS_LINK_COMMAND = exports_link_command.txt -CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -fcf-protection=none) +CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -fcf-protection=branch -mindirect-branch-register) +CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -mharden-sls=all) ifeq ($(TARGET_ARCH),x86_64) CONDITIONAL_CFLAGS += $(call TEST_CC_ARG, -mindirect-branch-register)
maybe I'm doing something wrong, but it still doesn't work for me
ibt=off works for me, although my device still blocks on boot while loading nvidia drivers some of the time. Hopefully an update will come out soon.
xps15 9510
ibt=off
works also for me in this relative order to IOMMU setup for PCI passthrough to VMs:
BOOT_IMAGE=<XXXXX> root=<XXXXX> rw intel_iommu=on vfio-pci.ids=10de:0e1a nvidia-drm.modeset=1 ibt=off loglevel=3
I am not sure the order matters much but i placed it right after all the passthrough flags
this is running: 5.18.1 12th Gen Intel 12-129000K GPU 1: NVIDIA GeForce GTX 780 (passed through to vms) GPU 2: NVIDIA GeForce GTX 1080 (running the main system)
The patch worked for me
I'm running: Linux 5.18.1 Intel Core i3-12100F NVIDIA GeForce RTX 3050
Tracking internally as bug 3665573
Same issue here. Kernel: 5.18 - GPU Driver: NVIDIA 515.43.04 - Rtx 3080 desktop
@atiensivu thank you, what does ibt=off do? It boots if I use it
Turns off IBT at 'run-time' for scenarios like this. Ideal fix is to make the driver work with IBT, but in the meantime, to get things working, this is a good workaround.
I see an update to linux-5.18.1.arch1-1
and nvidia-515.43.04-7
. Can anyone confirm if it has been fixed in 5.18.1?
Not tested but it seems so, see here: https://bugs.archlinux.org/task/74886#comment208658
Not working for me on 5.18.1
I have updated to the latest kernel and drivers, but the issue has not been fixed, at least on my machine. I still have to disable ibt when booting the system.
I use an RTX3070 and an i5-12600KF.
❯ pacman -Q nvidia
nvidia 515.43.04-7
❯ pacman -Q linux
linux 5.18.1.arch1-1
Issue persists on 5.18.1 (archlinux), with either the nvidia or nvidia-open drivers installed. Irritatingly, the module "fails to load" even when no Nvidia GPU is connected.
For Archlinux, I've misread the issue, it should be fixed in the nvidia-515.43.04-8 version. Which isn't yet published: https://archlinux.org/packages/extra/x86_64/nvidia/ But you can still try it by downloading the package on the issue and install it from there.
Assuming nvidia-open-515.43.04-9 includes the mentioned fix, I just tried it with linux 5.18.1 and the issue is still there for me.
I use Arch with a laptop that has i7-1165G7 and the nvidia driver is for an external gpu which is not currently connected.
Edit: Using the updated nvidia-open driver resolved the issue.
Regarding the ibt=off workaround: Does setting this flag make the kernel less secure than 5.17? Or was IBT only added in 5.18?
For Archlinux, I've misread the issue, it should be fixed in the nvidia-515.43.04-8 version. Which isn't yet published: https://archlinux.org/packages/extra/x86_64/nvidia/ But you can still try it by downloading the package on the issue and install it from there.
Are you referring to the Nvidia LTS Branch? I think I found the -8 version you were talking about. Also -7 -8 -9 are just referring to the Repositories Nvidia, Nivida-lts and the Nividia-open. 5.15.43.04-7 being the " Nividia " Repository, 5.15.43.04 -8 being Nividia-lts and 5.15.43.04-9 being Nvidia-open. I noticed this from the official repo page 19. https://archlinux.org/packages/?page=19&repo=Extra
Edit- Just making sure my thinking is right here since I got a thumbs down but it seems to me that the -7 -8 and -9 is just a naming scheme for the 3 different repositories.
Edit #2 it seems that the -8 or lts version is NOT the same as the nvidia-open-515.43.04-8 and correct me if I am wrong I think that -9 is not the updated version of that branch either. I think you have to install the exact nvidia-open-515.43.04-8 at https://bugs.archlinux.org/task/74886#comment208651 to install the fix
I see an update to
linux-5.18.1.arch1-1
andnvidia-515.43.04-7
. Can anyone confirm if it has been fixed in 5.18.1?
I can confirm that my problem was really solved with the patch from here https://github.com/NVIDIA/open-gpu-kernel-modules/issues/256#issuecomment-1141350315 and upgrade to 5.18.1-zen
latest version of nvidia-dkms still doesn't work without ibt=off
Assuming nvidia-open-515.43.04-9 includes the mentioned fix, I just tried it with linux 5.18.1 and the issue is still there for me.
I use Arch on Lenovo x1-carbon gen-9 and the nvidia driver is for an external gpu which is not currently connected.
I also tried updating to the latest nvidia-dkms(nvidia-dkms 515.48.07-1) and nvidia-utils packages available in the testing repository, but
system still won't boot without ibt=off
~~I can confirm that my problem was comepletely solved with the patch from here
https://github.com/NVIDIA/open-gpu-kernel-modules/issues/256#issuecomment-1141350315
Works with 5.18-arch1-1 !!!~~
Make sure you download the file here at the VERY bottom of the page https://bugs.archlinux.org/task/74886#comment208651
Extract it into your home folder.
then in the Terminal
cd nvidia-open
makepkg -g >> PKGBUILD
makepkg -si
I posted the terminal commands because I had no idea how to apply the patch. Just for the normies who haven't dabbled in patching packages like me. EDIT - I forgot the `si' thats for it to install.
Edit 2 - I think installing the patch this way should work. just the ArchTitus script may have messed up me installing any Nvidia open?
Edit 3 - Is a common outcome with this issue your Nvidia card not recognizing nvidia-open as a compatible driver at all? Or is it just the ArchTitus script causing that? This Picture is the result after I installed the custom patch
the -x numbering is an Arch specific package version bump, see https://wiki.archlinux.org/title/Arch_package_guidelines#Package_versioning In the case of Nvidia drivers, this often happens whenever there's a new kernel version. You can check under "View changes", for example:
515.43.04-6: linux 5.18.arch1-1 515.43.04-5: linux 5.17.9.arch1-1
NVIDIA Open GPU Kernel Modules Version
515.43.04
Does this happen with the proprietary driver (of the same version) as well?
Yes
Operating System and Version
Arch Linux
Kernel Release
5.18.0-arch1-1
Hardware: GPU
RTX 3070 laptop (System 76 Oryx 8)
Describe the bug
Since upgrading to Kernel 5.18, loading the nvidia driver (Or proprietary one) fails with the same kernel log:
To Reproduce
Bug Incidence
Always
nvidia-bug-report.log.gz
nvidia-bug-report.log.gz
More Info
Originally I thought this issue was to do with optimus-manager (As I am using a hybrid setup I use that utility to switch between intel and nvidia mode), but after uninstalling optimus manager the same issue occurs