Closed cognitivetech closed 6 months ago
While memtest_vulkan tries to use simplest GPU commands to avoid situation "GPU problems lead to hang before logging errors" - this is not always possible - somtimes errors appear "atomically" - it is "all working then completely hang".
The checkerboard pattern during hang almost always means hardware problems. Sometimes those can be solved by under-clocking GPU and memory (start with a extreme undercloking of both GPU ans memory to find if it helps at all; if it helps find, then find a max stable clokcs) .
Adding Option "Coolbits" "28"
line into xorg.conf enables underclocking RTX 30x0 via nvidia-settings GUI.
Underclocking this way allows achieving smallest clocks for testing purposes, but is not compatible with wayland.
I had no experience/success with other methods mentined in Arch wiki, amybe some of them works fine.
I'll convert this to a card-specific didcussion since this is not a memtest_vulkan problem.
log output nothing unusual:
I must reboot to restore function. Mostly this card is working fine, but using this test, and under random occasion with heavy load getting this problem, but I can't diagnose.