Open samjcarter opened 2 years ago
Original issue post will be closed (fix available).
Close this too, please!
I mentioned this in more detail over in https://github.com/fedora-silverblue/issue-tracker/issues/286#issuecomment-1163972288, but I don't believe the fix mentioned by @ivanvorstanenko is related to this bug report, and I don't believe this ticket should be closed.
i get a similar situation as described in 6.11, except i don't have to run the disable_threads script. just running the install command again after the initial failure proceeds with the install. so i have to run the install command twice for everything. luckily i don't use the ostree installer often.
1. Links to other posts of this issue
Initial discussion with @travier and @jbirch-atlassian on Fedora Silverblue Github issue #286 suggested a combination of contributing systems couldn't be ruled out, and that some exploration between them would likely be needed.
Therefore I'm reposting the issue on the following issue/bug trackers:
2. Contents
Links to other posts of this issue
Contents
Description of the issue
To Reproduce
Expected behavior
Screenshots / Terminal Output
OS version: (Host System Details)
Would you like to work on this issue?
3. Description of issue
I have two workstations with Fedora 36 Silverblue. One of them has a 6 core, 12 thread Intel cpu, and the other, a 16 core 32 thread AMD cpu. Both use Nvidia graphics cards.
On the 16 core machine only, while attempting to
rpm-ostree install akmod-nvidia xorg-x11-drv-nvidia-cuda
drivers, and on all subsequence uses ofrpm-ostree install
with any other package, the install fails with an error. Before attempting to install nvidia drivers, other packages installed withrpm-ostree install package
succeed without errors.I can work around the error (and stop it from showing) by using a short bash script to disable some of the CPU cores on the 16 core computer. Imust run the script before every use of
rpm-ostree install
. The error never occurs when carrying out the same steps on the 6 core computer. The only difference being, that computer has an older graphics card, and so mustrpm-ostree install akmod-nvidia-470xx xorg-x11-drv-nvidia-470xx-cuda
instead.4. To Reproduce
Please describe the steps needed to reproduce the bug:
rpm-ostree update
rpm-ostree install htop
(htop as an example)systemctl reboot
.sudo rpm-ostree install https://mirrors.rpmfusion.org/free/fedora/rpmfusion-free-release-$(rpm -E %fedora).noarch.rpm https://mirrors.rpmfusion.org/nonfree/fedora/rpmfusion-nonfree-release-$(rpm -E %fedora).noarch.rpm
. Detailed instructions at https://rpmfusion.org/Configurationsudo rpm-ostree install akmod-nvidia xorg-x11-drv-nvidia-cuda
disable-threads.sh
and make it executablechmod +x disable-threads.sh
.sudo ./dissable-threads.sh false 9 31
. (confirm in System Monitor Resources tab)rpm-ostree install package
and it should complete without the error.sudo rpm-ostree kargs --append=rd.driver.blacklist=nouveau --append=modprobe.blacklist=nouveau --append=nvidia-drm.modeset=1
5. Expected behavior
No error should interrupt the nvidia driver install on the 16 core computer. All subsequent uses of
rpm-ostree install package
should also not fail with the same error. The behaviour of the 16 core computer should match the 6 core computer, where the error never appears.6. Screenshots / Terminal Output
6.1 Fresh Fedora 36 Silverblue install with updates done
6.2 Install a layered package, eg; htop
6.3 Check rpm-ostree status
first;
systemctl reboot
6.4 Remove HTOP again
rpm-ostree uninstall htop
reboot and check it's gone:
Htop successfully removed.
6.5 Add rpmfusion repositories (successful)
6.6 Attempt Nvidia driver install (Fails with Error)
6.7 Output of
journalctl -t 'rpm-ostree(akmod-nvidia.post)'
6.8 Run our dissable-threads.sh script
sudo ./dissable-threads.sh false 9 31
6.9 Attempt Nvidia driver install second time (success)
6.10 Reboot and check status
systemctl reboot
6.11 Try to install HTOP again; without running the disable-threads.sh script (Fails with error)
6.12 Output from
journalctl -t 'rpm-ostree(akmod-nvidia.post)'
Journlctl Log continues for approximately 8500 lines listing similar output; `/lib/modules/...kernel...needs... "something": /lib/modules/...
At this point, I have saved a copy of all the log items
journalctl -t 'rpm-ostree(akmod-nvidia.post)' > journalctl.txt
.6.13 Try to install HTOP again; AFTER running the disable-threads.sh script (succeeds)
sudo disable-threads.sh false 9 31
6.14 Check
rpm-ostree status
7. OS version: (Host System Details)
8. Would you like to work on this issue?
No, I don't have the skillset.