nktice / AMD-AI

AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 22.04 / 23.04 / 23.10 / 24.04
Other
136 stars 8 forks source link

Any word on Ubuntu 23.10? #1

Closed kyancey closed 6 months ago

kyancey commented 6 months ago

I followed your guide for 23.04 on the hope that it would work. However, I got stuck at installing amdgpu-dkms.

Today however, the system offered me a software update (through the gnome gui software updater. amdgpu-dkms was listed. So I tried it, and it installed.

My hope was premature though. While it installed, it doesn't seem to be working.

sudo /opt/rocm/bin/rocminfo gives me this:

ROCk module is NOT loaded, possibly no GPU devices

This is my dpkg.log. (Obviously, only the most recently installed stuff.)

2023-12-06 07:52:51 install linux-modules-6.5.0-14-generic:amd64 6.5.0-14.14 2023-12-06 07:52:52 install linux-image-6.5.0-14-generic:amd64 6.5.0-14.14 2023-12-06 07:52:52 install linux-modules-extra-6.5.0-14-generic:amd64 6.5.0-14.14 2023-12-06 07:52:55 install linux-headers-6.5.0-14:all 6.5.0-14.14 2023-12-06 07:53:02 install linux-headers-6.5.0-14-generic:amd64 6.5.0-14.14 2023-12-09 22:45:20 install dkms:all 3.0.11-1ubuntu10 3.0.11-1ubuntu10 2023-12-09 22:45:20 install amdgpu-dkms-firmware:all 1:6.2.4.50700-1652687.22.04 2023-12-09 22:45:20 install amdgpu-dkms:all 1:6.2.4.50700-1652687.22.04 1:6.2.4.50700-1652687.22.04 2023-12-17 11:21:25 install rocm-dkms:amd64 5.7.1.50701-98~22.04 2023-12-17 11:24:31 install nvtop:amd64 3.0.2-1 2023-12-17 11:24:38 install rovclock:amd64 0.6e-7build1 2023-12-17 11:24:38 install radeontop:amd64 1.4-2 2023-12-17 11:27:48 install python3-pip-whl:all 23.2+dfsg-1ubuntu0.1 2023-12-17 11:27:48 install python3-setuptools-whl:all 68.1.2-2 2023-12-17 11:27:48 install python3.11-venv:amd64 3.11.6-3 2023-12-17 11:27:48 install python3-venv:amd64 3.11.4-5

I'd value any thoughts or insights you might have. I realize it's a long shot. This is the curse of using the bleeding edge.

nktice commented 6 months ago

ROCm 6.0 is out... So I've updated the guides to use the new driver. It also appears that this version works for me with Ubuntu 23.10.
https://github.com/nktice/AMD-AI/blob/main/ROCm6.0.md

kyancey commented 6 months ago

Cool. It still doesn't seem to work for me on the 7900XTX, but thanks for the effort. I tried uninstalling and reinstalling everything. Not sure what's the matter, but I got the same result. It's probably something I previously installed. I'll keep trying different things.

By the way, on this part:

ROCm...

sudo apt install -y rocm-dev rocm-libs rocm-hip-sdk rocm-dkms rocm-libs

sudo apt install -y rocm-opencl rocm-opencl-dev

sudo apt install -y hipsparse hipblas hipblas-dev hipcub

sudo apt isntall -y rocblas rocblas-dev rccl rocthrust roctracer-dev

The last line has a typo. It says isntall instead of install. I know it's commented out, but I thought you'd want to know.

nktice commented 6 months ago

Are you installing on a fresh system install, or on an upgrade? What entries are in /var/log/dmesg for your video card?
What do you get when if you run the following terminal command ?

apt list  | grep amdgpu-dkms
kyancey commented 6 months ago

It's a fresh install. I didn't upgrade from a previous Ubuntu version. I worried that going through your installation steps before had installed something I didn't need. So I went back and uninstalled everything. I then went through the steps again. The problem doesn't seem to be that it's not installed. I get this from the command above:

amdgpu-dkms-firmware/jammy,now 1:6.3.6.60000-1697589.22.04 all [installed,automatic] amdgpu-dkms-headers/jammy 1:6.3.6.60000-1697589.22.04 all amdgpu-dkms/jammy,now 1:6.3.6.60000-1697589.22.04 all [installed]

The problem is that despite everything seemingly being installed, I get this when I run /opt/rocm/bin/rocminfo:

ROCk module is NOT loaded, possibly no GPU devices

I also ran lsmod | grep amdgpu and got nothing. So I simply think the kernel module isn't loaded.

Does amdgpu-dkms install a binary? Or do I have to build the module from source?

I also ran dkms status:

amdgpu/6.3.6-1697589.22.04, 6.5.0-14-generic, x86_64: installed

nktice commented 6 months ago

Are you running on raw hardware, or under some sort of virtualization ( such as wsl ) ? What do you get when you run

uname -a 
kyancey commented 6 months ago

Linux y60-ubuntu 6.5.0-14-generic #14-Ubuntu SMP PREEMPT_DYNAMIC Tue Nov 14 14:59:49 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux

Yes, I'm running on raw hardware. My specs:

AMD Ryzen™ 9 7900X Processor 32 GB [16 GB X2] DDR5-5600MHz XPG Lancer RGB AMD Radeon RX 7900 XTX - 24GB GDDR6 - XFX SPEEDSTER MERC310 MSI PRO B650-P WIFI - WIFI6E, ARGB Header (2), USB 3.2 Ports (1 Type-C, 3 Type-A), M.2 Slot (2) 2TB Kingston Fury Renegade M.2 PCIe Gen 4 NVMe SSD

grep 'amd' /proc/modules

edac_mce_amd 40960 0 - Live 0x0000000000000000 kvm_amd 208896 0 - Live 0x0000000000000000 kvm 1404928 1 kvm_amd, Live 0x0000000000000000 ccp 135168 1 kvm_amd, Live 0x0000000000000000 gpio_amdpt 16384 0 - Live 0x0000000000000000

ls /lib/modules/

6.5.0-10-generic 6.5.0-13-generic 6.5.0-14-generic

So I haven't upgraded from Ubuntu 22.10, but I did do a few updates since installing 23.10.

In the 14-generic folder, this is what cat modules.alias | grep amd returns:

alias cpu:type:x86,venfammod:feature:00C2 kvm_amd alias pci:v00001022d0000746Bsvsdbcsci gpio_amd8111 alias platform:gpio_amd_fch gpio_amd_fch alias acpi:AMDIF031: gpio_amdpt alias acpi:AMDIF030: gpio_amdpt alias acpi:AMDF030: gpio_amdpt alias pci:v00001022d0000746Bsvsdbcsci amd_rng alias pci:v00001022d00007443svsdbcsci amd_rng alias pci:v00001022d00002092svsdbcsci pata_amd alias pci:v00001022d0000209Asvsdbcsci pata_amd alias pci:v000010DEd00000759svsdbcsci pata_amd alias pci:v000010DEd0000056Csvsdbcsci pata_amd alias pci:v000010DEd00000560svsdbcsci pata_amd alias pci:v000010DEd00000448svsdbcsci pata_amd alias pci:v000010DEd000003ECsvsdbcsci pata_amd alias pci:v000010DEd0000036Esvsdbcsci pata_amd alias pci:v000010DEd00000265svsdbcsci pata_amd alias pci:v000010DEd00000035svsdbcsci pata_amd alias pci:v000010DEd00000053svsdbcsci pata_amd alias pci:v000010DEd000000E5svsdbcsci pata_amd alias pci:v000010DEd000000D5svsdbcsci pata_amd alias pci:v000010DEd00000085svsdbcsci pata_amd alias pci:v000010DEd00000065svsdbcsci pata_amd alias pci:v000010DEd000001BCsvsdbcsci pata_amd alias pci:v00001022d00007469svsdbcsci pata_amd alias pci:v00001022d00007441svsdbcsci pata_amd alias pci:v00001022d00007411svsdbcsci pata_amd alias pci:v00001022d00007409svsdbcsci pata_amd alias pci:v00001022d00007401svsdbcsci pata_amd alias of:NTCamd,pensando-elba-spiC spi_dw_mmio alias of:NTCamd,pensando-elba-spi spi_dw_mmio alias acpi:AMDI0062: spi_amd alias acpi:AMDI0061: spi_amd alias mdio:0000000000100010010101100001???? amd alias pci:v00001022d00007462sv00000000sd00000000bcsci amd8111e alias pci:v00001022d00001459svsdbcsci amd_xgbe alias pci:v00001022d00001458svsdbcsci amd_xgbe alias acpi:AMDI8001: amd_xgbe alias pci:v00001022d00002096svsdbc0Csc03iFE amd5536udc_pci alias pci:v000010DEd000001B4svsdbcsci i2c_amd756 alias pci:v00001022d0000746Bsvsdbcsci i2c_amd756 alias pci:v00001022d00007443svsdbcsci i2c_amd756 alias pci:v00001022d00007413svsdbcsci i2c_amd756 alias pci:v00001022d0000740Bsvsdbcsci i2c_amd756 alias pci:v00001022d0000746Asvsdbcsci i2c_amd8111 alias pci:v00001022d000015E6svsdbcsci i2c_amd_mp2_pci alias acpi:AMDI0011: i2c_amd_mp2_plat alias of:NTCamd,sbtsiC sbtsi_temp alias of:NTCamd,sbtsi sbtsi_temp alias of:NTCamd,sbrmiC sbrmi alias of:NTCamd,sbrmi sbrmi alias edac-mce-amd edac_mce_amd alias cpu:type:x86,ven0002fam0019mod:feature: amd64_edac alias cpu:type:x86,ven0009fam0018mod:feature: amd64_edac alias cpu:type:x86,ven0002fam0017mod:feature: amd64_edac alias cpu:type:x86,ven0002fam0016mod:feature: amd64_edac alias cpu:type:x86,ven0002fam0015mod:feature: amd64_edac alias cpu:type:x86,ven0002fam0010mod:feature: amd64_edac alias cpu:type:x86,ven0002fam000Fmod:feature: amd64_edac alias cpu:type:x86,venfammod:feature:00E9 amd_freq_sensitivity alias pci:v00001022d0000164Asvsdbcsci amd_sfh alias pci:v00001022d000015E4svsdbcsci amd_sfh alias acpi:AMD0005: amd_pmc alias acpi:AMD0004: amd_pmc alias acpi:AMDI000A: amd_pmc alias acpi:AMDI0009: amd_pmc alias acpi:AMDI0008: amd_pmc alias acpi:AMDI0007: amd_pmc alias acpi:AMDI0006: amd_pmc alias acpi:AMDI0005: amd_pmc alias acpi:AMDI0103: amd_pmf alias acpi:AMDI0102: amd_pmf alias acpi:AMDI0100: amd_pmf alias pci:v00001022d00007468sv00000000sd00000000bcsci amd76xrom alias pci:v00001022d00007440svsdbcsci amd76xrom alias pci:v00001022d00007410svsdbcsci amd76xrom alias platform:amd_sdw_manager soundwire_amd alias platform:amd_ps_sdw_dma snd_ps_sdw_dma alias pci:v00001022d000015E2svsdbcsci snd_sof_amd_renoir alias pci:v00001022d000015E2svsdbcsci snd_sof_amd_rembrandt alias pci:v00001002dsvsdbc12sc00i00 amdgpu alias pci:v00001002dsvsdbc03sc80i00 amdgpu alias pci:v00001002dsvsdbc03sc00i00 amdgpu alias pci:v00001002d0000743Fsvsdbcsci amdgpu alias pci:v00001002d00007424svsdbcsci amdgpu alias pci:v00001002d00007423svsdbcsci amdgpu alias pci:v00001002d00007422svsdbcsci amdgpu alias pci:v00001002d00007421svsdbcsci amdgpu alias pci:v00001002d00007420svsdbcsci amdgpu alias pci:v00001002d0000143Fsvsdbcsci amdgpu alias pci:v00001002d000013FEsvsdbcsci amdgpu alias pci:v00001002d00007410svsdbcsci amdgpu alias pci:v00001002d0000740Fsvsdbcsci amdgpu alias pci:v00001002d0000740Csvsdbcsci amdgpu alias pci:v00001002d00007408svsdbcsci amdgpu alias pci:v00001002d000073FFsvsdbcsci amdgpu alias pci:v00001002d000073EFsvsdbcsci amdgpu alias pci:v00001002d000073EDsvsdbcsci amdgpu alias pci:v00001002d000073ECsvsdbcsci amdgpu alias pci:v00001002d000073EBsvsdbcsci amdgpu alias pci:v00001002d000073EAsvsdbcsci amdgpu alias pci:v00001002d000073E9svsdbcsci amdgpu alias pci:v00001002d000073E8svsdbcsci amdgpu alias pci:v00001002d000073E3svsdbcsci amdgpu alias pci:v00001002d000073E2svsdbcsci amdgpu alias pci:v00001002d000073E1svsdbcsci amdgpu alias pci:v00001002d000073E0svsdbcsci amdgpu alias pci:v00001002d000073DFsvsdbcsci amdgpu alias pci:v00001002d000073DEsvsdbcsci amdgpu alias pci:v00001002d000073DDsvsdbcsci amdgpu alias pci:v00001002d000073DCsvsdbcsci amdgpu alias pci:v00001002d000073DBsvsdbcsci amdgpu alias pci:v00001002d000073DAsvsdbcsci amdgpu alias pci:v00001002d000073C3svsdbcsci amdgpu alias pci:v00001002d000073C1svsdbcsci amdgpu alias pci:v00001002d000073C0svsdbcsci amdgpu alias pci:v00001002d00001681svsdbcsci amdgpu alias pci:v00001002d0000164Dsvsdbcsci amdgpu alias pci:v00001002d000073BFsvsdbcsci amdgpu alias pci:v00001002d000073AFsvsdbcsci amdgpu alias pci:v00001002d000073AEsvsdbcsci amdgpu alias pci:v00001002d000073ADsvsdbcsci amdgpu alias pci:v00001002d000073ACsvsdbcsci amdgpu alias pci:v00001002d000073ABsvsdbcsci amdgpu alias pci:v00001002d000073A9svsdbcsci amdgpu alias pci:v00001002d000073A8svsdbcsci amdgpu alias pci:v00001002d000073A5svsdbcsci amdgpu alias pci:v00001002d000073A3svsdbcsci amdgpu alias pci:v00001002d000073A2svsdbcsci amdgpu alias pci:v00001002d000073A1svsdbcsci amdgpu alias pci:v00001002d000073A0svsdbcsci amdgpu alias pci:v00001002d00007362svsdbcsci amdgpu alias pci:v00001002d00007360svsdbcsci amdgpu alias pci:v00001002d0000164Csvsdbcsci amdgpu alias pci:v00001002d00001638svsdbcsci amdgpu alias pci:v00001002d00001636svsdbcsci amdgpu alias pci:v00001002d000015E7svsdbcsci amdgpu alias pci:v00001002d0000734Fsvsdbcsci amdgpu alias pci:v00001002d00007347svsdbcsci amdgpu alias pci:v00001002d00007341svsdbcsci amdgpu alias pci:v00001002d00007340svsdbcsci amdgpu alias pci:v00001002d0000731Fsvsdbcsci amdgpu alias pci:v00001002d0000731Esvsdbcsci amdgpu alias pci:v00001002d0000731Bsvsdbcsci amdgpu alias pci:v00001002d0000731Asvsdbcsci amdgpu alias pci:v00001002d00007319svsdbcsci amdgpu alias pci:v00001002d00007318svsdbcsci amdgpu alias pci:v00001002d00007312svsdbcsci amdgpu alias pci:v00001002d00007310svsdbcsci amdgpu alias pci:v00001002d00007390svsdbcsci amdgpu alias pci:v00001002d0000738Esvsdbcsci amdgpu alias pci:v00001002d00007388svsdbcsci amdgpu alias pci:v00001002d0000738Csvsdbcsci amdgpu alias pci:v00001002d000015D8svsdbcsci amdgpu alias pci:v00001002d000015DDsvsdbcsci amdgpu alias pci:v00001002d000066AFsvsdbcsci amdgpu alias pci:v00001002d000066A7svsdbcsci amdgpu alias pci:v00001002d000066A4svsdbcsci amdgpu alias pci:v00001002d000066A3svsdbcsci amdgpu alias pci:v00001002d000066A2svsdbcsci amdgpu alias pci:v00001002d000066A1svsdbcsci amdgpu alias pci:v00001002d000066A0svsdbcsci amdgpu alias pci:v00001002d000069AFsvsdbcsci amdgpu alias pci:v00001002d000069A3svsdbcsci amdgpu alias pci:v00001002d000069A2svsdbcsci amdgpu alias pci:v00001002d000069A1svsdbcsci amdgpu alias pci:v00001002d000069A0svsdbcsci amdgpu alias pci:v00001002d0000687Fsvsdbcsci amdgpu alias pci:v00001002d0000686Fsvsdbcsci amdgpu alias pci:v00001002d0000686Esvsdbcsci amdgpu alias pci:v00001002d0000686Dsvsdbcsci amdgpu alias pci:v00001002d0000686Csvsdbcsci amdgpu alias pci:v00001002d0000686Bsvsdbcsci amdgpu alias pci:v00001002d0000686Asvsdbcsci amdgpu alias pci:v00001002d00006869svsdbcsci amdgpu alias pci:v00001002d00006868svsdbcsci amdgpu alias pci:v00001002d00006867svsdbcsci amdgpu alias pci:v00001002d00006864svsdbcsci amdgpu alias pci:v00001002d00006863svsdbcsci amdgpu alias pci:v00001002d00006862svsdbcsci amdgpu alias pci:v00001002d00006861svsdbcsci amdgpu alias pci:v00001002d00006860svsdbcsci amdgpu alias pci:v00001002d0000694Fsvsdbcsci amdgpu alias pci:v00001002d0000694Esvsdbcsci amdgpu alias pci:v00001002d0000694Csvsdbcsci amdgpu alias pci:v00001002d0000699Fsvsdbcsci amdgpu alias pci:v00001002d00006997svsdbcsci amdgpu alias pci:v00001002d00006995svsdbcsci amdgpu alias pci:v00001002d00006987svsdbcsci amdgpu alias pci:v00001002d00006986svsdbcsci amdgpu alias pci:v00001002d00006985svsdbcsci amdgpu alias pci:v00001002d00006981svsdbcsci amdgpu alias pci:v00001002d00006980svsdbcsci amdgpu alias pci:v00001002d00006FDFsvsdbcsci amdgpu alias pci:v00001002d000067CFsvsdbcsci amdgpu alias pci:v00001002d000067CCsvsdbcsci amdgpu alias pci:v00001002d000067CAsvsdbcsci amdgpu alias pci:v00001002d000067C9svsdbcsci amdgpu alias pci:v00001002d000067C8svsdbcsci amdgpu alias pci:v00001002d000067DFsvsdbcsci amdgpu alias pci:v00001002d000067D0svsdbcsci amdgpu alias pci:v00001002d000067C7svsdbcsci amdgpu alias pci:v00001002d000067C4svsdbcsci amdgpu alias pci:v00001002d000067C2svsdbcsci amdgpu alias pci:v00001002d000067C1svsdbcsci amdgpu alias pci:v00001002d000067C0svsdbcsci amdgpu alias pci:v00001002d000067E9svsdbcsci amdgpu alias pci:v00001002d000067E7svsdbcsci amdgpu alias pci:v00001002d000067E1svsdbcsci amdgpu alias pci:v00001002d000067FFsvsdbcsci amdgpu alias pci:v00001002d000067EFsvsdbcsci amdgpu alias pci:v00001002d000067EBsvsdbcsci amdgpu alias pci:v00001002d000067E8svsdbcsci amdgpu alias pci:v00001002d000067E3svsdbcsci amdgpu alias pci:v00001002d000067E0svsdbcsci amdgpu alias pci:v00001002d000098E4svsdbcsci amdgpu alias pci:v00001002d00009877svsdbcsci amdgpu alias pci:v00001002d00009876svsdbcsci amdgpu alias pci:v00001002d00009875svsdbcsci amdgpu alias pci:v00001002d00009874svsdbcsci amdgpu alias pci:v00001002d00009870svsdbcsci amdgpu alias pci:v00001002d0000730Fsvsdbcsci amdgpu alias pci:v00001002d00007300svsdbcsci amdgpu alias pci:v00001002d00006939svsdbcsci amdgpu alias pci:v00001002d00006938svsdbcsci amdgpu alias pci:v00001002d00006930svsdbcsci amdgpu alias pci:v00001002d0000692Fsvsdbcsci amdgpu alias pci:v00001002d0000692Bsvsdbcsci amdgpu alias pci:v00001002d00006929svsdbcsci amdgpu alias pci:v00001002d00006928svsdbcsci amdgpu alias pci:v00001002d00006921svsdbcsci amdgpu alias pci:v00001002d00006920svsdbcsci amdgpu alias pci:v00001002d00006907svsdbcsci amdgpu alias pci:v00001002d00006903svsdbcsci amdgpu alias pci:v00001002d00006902svsdbcsci amdgpu alias pci:v00001002d00006901svsdbcsci amdgpu alias pci:v00001002d00006900svsdbcsci amdgpu alias pci:v00001002d0000985Fsvsdbcsci amdgpu alias pci:v00001002d0000985Esvsdbcsci amdgpu alias pci:v00001002d0000985Dsvsdbcsci amdgpu alias pci:v00001002d0000985Csvsdbcsci amdgpu alias pci:v00001002d0000985Bsvsdbcsci amdgpu alias pci:v00001002d0000985Asvsdbcsci amdgpu alias pci:v00001002d00009859svsdbcsci amdgpu alias pci:v00001002d00009858svsdbcsci amdgpu alias pci:v00001002d00009857svsdbcsci amdgpu alias pci:v00001002d00009856svsdbcsci amdgpu alias pci:v00001002d00009855svsdbcsci amdgpu alias pci:v00001002d00009854svsdbcsci amdgpu alias pci:v00001002d00009853svsdbcsci amdgpu alias pci:v00001002d00009852svsdbcsci amdgpu alias pci:v00001002d00009851svsdbcsci amdgpu alias pci:v00001002d00009850svsdbcsci amdgpu alias pci:v00001002d0000983Fsvsdbcsci amdgpu alias pci:v00001002d0000983Esvsdbcsci amdgpu alias pci:v00001002d0000983Dsvsdbcsci amdgpu alias pci:v00001002d0000983Csvsdbcsci amdgpu alias pci:v00001002d0000983Bsvsdbcsci amdgpu alias pci:v00001002d0000983Asvsdbcsci amdgpu alias pci:v00001002d00009839svsdbcsci amdgpu alias pci:v00001002d00009838svsdbcsci amdgpu alias pci:v00001002d00009837svsdbcsci amdgpu alias pci:v00001002d00009836svsdbcsci amdgpu alias pci:v00001002d00009835svsdbcsci amdgpu alias pci:v00001002d00009834svsdbcsci amdgpu alias pci:v00001002d00009833svsdbcsci amdgpu alias pci:v00001002d00009832svsdbcsci amdgpu alias pci:v00001002d00009831svsdbcsci amdgpu alias pci:v00001002d00009830svsdbcsci amdgpu alias pci:v00001002d000067BEsvsdbcsci amdgpu alias pci:v00001002d000067BAsvsdbcsci amdgpu alias pci:v00001002d000067B9svsdbcsci amdgpu alias pci:v00001002d000067B8svsdbcsci amdgpu alias pci:v00001002d000067B1svsdbcsci amdgpu alias pci:v00001002d000067B0svsdbcsci amdgpu alias pci:v00001002d000067AAsvsdbcsci amdgpu alias pci:v00001002d000067A9svsdbcsci amdgpu alias pci:v00001002d000067A8svsdbcsci amdgpu alias pci:v00001002d000067A2svsdbcsci amdgpu alias pci:v00001002d000067A1svsdbcsci amdgpu alias pci:v00001002d000067A0svsdbcsci amdgpu alias pci:v00001002d0000665Fsvsdbcsci amdgpu alias pci:v00001002d0000665Dsvsdbcsci amdgpu alias pci:v00001002d0000665Csvsdbcsci amdgpu alias pci:v00001002d00006658svsdbcsci amdgpu alias pci:v00001002d00006651svsdbcsci amdgpu alias pci:v00001002d00006650svsdbcsci amdgpu alias pci:v00001002d00006649svsdbcsci amdgpu alias pci:v00001002d00006647svsdbcsci amdgpu alias pci:v00001002d00006646svsdbcsci amdgpu alias pci:v00001002d00006641svsdbcsci amdgpu alias pci:v00001002d00006640svsdbcsci amdgpu alias pci:v00001002d0000131Dsvsdbcsci amdgpu alias pci:v00001002d0000131Csvsdbcsci amdgpu alias pci:v00001002d0000131Bsvsdbcsci amdgpu alias pci:v00001002d00001318svsdbcsci amdgpu alias pci:v00001002d00001317svsdbcsci amdgpu alias pci:v00001002d00001316svsdbcsci amdgpu alias pci:v00001002d00001315svsdbcsci amdgpu alias pci:v00001002d00001313svsdbcsci amdgpu alias pci:v00001002d00001312svsdbcsci amdgpu alias pci:v00001002d00001311svsdbcsci amdgpu alias pci:v00001002d00001310svsdbcsci amdgpu alias pci:v00001002d0000130Fsvsdbcsci amdgpu alias pci:v00001002d0000130Esvsdbcsci amdgpu alias pci:v00001002d0000130Dsvsdbcsci amdgpu alias pci:v00001002d0000130Csvsdbcsci amdgpu alias pci:v00001002d0000130Bsvsdbcsci amdgpu alias pci:v00001002d0000130Asvsdbcsci amdgpu alias pci:v00001002d00001309svsdbcsci amdgpu alias pci:v00001002d00001307svsdbcsci amdgpu alias pci:v00001002d00001306svsdbcsci amdgpu alias pci:v00001002d00001305svsdbcsci amdgpu alias pci:v00001002d00001304svsdbcsci amdgpu alias pci:v00001002d0000666Fsvsdbcsci amdgpu alias pci:v00001002d00006667svsdbcsci amdgpu alias pci:v00001002d00006665svsdbcsci amdgpu alias pci:v00001002d00006664svsdbcsci amdgpu alias pci:v00001002d00006663svsdbcsci amdgpu alias pci:v00001002d00006660svsdbcsci amdgpu alias pci:v00001002d0000683Fsvsdbcsci amdgpu alias pci:v00001002d0000683Dsvsdbcsci amdgpu alias pci:v00001002d0000683Bsvsdbcsci amdgpu alias pci:v00001002d00006839svsdbcsci amdgpu alias pci:v00001002d00006838svsdbcsci amdgpu alias pci:v00001002d00006837svsdbcsci amdgpu alias pci:v00001002d00006835svsdbcsci amdgpu alias pci:v00001002d00006831svsdbcsci amdgpu alias pci:v00001002d00006830svsdbcsci amdgpu alias pci:v00001002d0000682Fsvsdbcsci amdgpu alias pci:v00001002d0000682Dsvsdbcsci amdgpu alias pci:v00001002d0000682Csvsdbcsci amdgpu alias pci:v00001002d0000682Bsvsdbcsci amdgpu alias pci:v00001002d0000682Asvsdbcsci amdgpu alias pci:v00001002d00006829svsdbcsci amdgpu alias pci:v00001002d00006828svsdbcsci amdgpu alias pci:v00001002d00006827svsdbcsci amdgpu alias pci:v00001002d00006826svsdbcsci amdgpu alias pci:v00001002d00006825svsdbcsci amdgpu alias pci:v00001002d00006824svsdbcsci amdgpu alias pci:v00001002d00006823svsdbcsci amdgpu alias pci:v00001002d00006822svsdbcsci amdgpu alias pci:v00001002d00006821svsdbcsci amdgpu alias pci:v00001002d00006820svsdbcsci amdgpu alias pci:v00001002d00006631svsdbcsci amdgpu alias pci:v00001002d00006623svsdbcsci amdgpu alias pci:v00001002d00006621svsdbcsci amdgpu alias pci:v00001002d00006620svsdbcsci amdgpu alias pci:v00001002d00006617svsdbcsci amdgpu alias pci:v00001002d00006613svsdbcsci amdgpu alias pci:v00001002d00006611svsdbcsci amdgpu alias pci:v00001002d00006610svsdbcsci amdgpu alias pci:v00001002d00006608svsdbcsci amdgpu alias pci:v00001002d00006607svsdbcsci amdgpu alias pci:v00001002d00006606svsdbcsci amdgpu alias pci:v00001002d00006605svsdbcsci amdgpu alias pci:v00001002d00006604svsdbcsci amdgpu alias pci:v00001002d00006603svsdbcsci amdgpu alias pci:v00001002d00006602svsdbcsci amdgpu alias pci:v00001002d00006601svsdbcsci amdgpu alias pci:v00001002d00006600svsdbcsci amdgpu alias pci:v00001002d00006819svsdbcsci amdgpu alias pci:v00001002d00006818svsdbcsci amdgpu alias pci:v00001002d00006817svsdbcsci amdgpu alias pci:v00001002d00006816svsdbcsci amdgpu alias pci:v00001002d00006811svsdbcsci amdgpu alias pci:v00001002d00006810svsdbcsci amdgpu alias pci:v00001002d00006809svsdbcsci amdgpu alias pci:v00001002d00006808svsdbcsci amdgpu alias pci:v00001002d00006806svsdbcsci amdgpu alias pci:v00001002d00006802svsdbcsci amdgpu alias pci:v00001002d00006801svsdbcsci amdgpu alias pci:v00001002d00006800svsdbcsci amdgpu alias pci:v00001002d0000679Fsvsdbcsci amdgpu alias pci:v00001002d0000679Esvsdbcsci amdgpu alias pci:v00001002d0000679Bsvsdbcsci amdgpu alias pci:v00001002d0000679Asvsdbcsci amdgpu alias pci:v00001002d00006799svsdbcsci amdgpu alias pci:v00001002d00006798svsdbcsci amdgpu alias pci:v00001002d00006792svsdbcsci amdgpu alias pci:v00001002d00006791svsdbcsci amdgpu alias pci:v00001002d00006790svsdbcsci amdgpu alias pci:v00001002d0000678Asvsdbcsci amdgpu alias pci:v00001002d00006788svsdbcsci amdgpu alias pci:v00001002d00006784svsdbcsci amdgpu alias pci:v00001002d00006780svsdbcsci amdgpu

updates/dkms has this:

amddrm_buddy.ko.zst amdgpu.ko.zst amd-sched.ko.zst amdxcp.ko.zst amddrm_ttm_helper.ko.zst amdkcl.ko.zst amdttm.ko.zst

kyancey commented 6 months ago

lspci returns

03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 31 [Radeon RX 7900 XT/7900 XTX] (rev c8) (prog-if 00 [VGA controller]) Subsystem: XFX Limited RX-79XMERCB9 [SPEEDSTER MERC 310 RX 7900 XTX] Flags: bus master, fast devsel, latency 0, IRQ 255, IOMMU group 15 Memory at f000000000 (64-bit, prefetchable) [size=32G] Memory at f800000000 (64-bit, prefetchable) [size=256M] I/O ports at f000 [disabled] [size=256] Memory at fca00000 (32-bit, non-prefetchable) [size=1M] Expansion ROM at fcb00000 [disabled] [size=128K] Capabilities: [48] Vendor Specific Information: Len=08 <?> Capabilities: [50] Power Management version 3 Capabilities: [64] Express Legacy Endpoint, MSI 00 Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+ Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?> Capabilities: [150] Advanced Error Reporting Capabilities: [200] Physical Resizable BAR Capabilities: [240] Power Budgeting <?> Capabilities: [270] Secondary PCI Express Capabilities: [2a0] Access Control Services Capabilities: [2d0] Process Address Space ID (PASID) Capabilities: [320] Latency Tolerance Reporting Capabilities: [410] Physical Layer 16.0 GT/s <?> Capabilities: [450] Lane Margining at the Receiver <?> Kernel modules: amdgpu

13:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Raphael (rev c2) (prog-if 00 [VGA controller]) Subsystem: Micro-Star International Co., Ltd. [MSI] Raphael Flags: bus master, fast devsel, latency 0, IRQ 255, IOMMU group 31 Memory at f820000000 (64-bit, prefetchable) [size=256M] Memory at f830000000 (64-bit, prefetchable) [size=2M] I/O ports at d000 [disabled] [size=256] Memory at fc900000 (32-bit, non-prefetchable) [size=512K] Capabilities: [48] Vendor Specific Information: Len=08 <?> Capabilities: [50] Power Management version 3 Capabilities: [64] Express Legacy Endpoint, MSI 00 Capabilities: [a0] MSI: Enable- Count=1/4 Maskable- 64bit+ Capabilities: [c0] MSI-X: Enable- Count=4 Masked- Capabilities: [100] Vendor Specific Information: ID=0001 Rev=1 Len=010 <?> Capabilities: [270] Secondary PCI Express Capabilities: [2a0] Access Control Services Capabilities: [2b0] Address Translation Service (ATS) Capabilities: [2c0] Page Request Interface (PRI) Capabilities: [2d0] Process Address Space ID (PASID) Capabilities: [410] Physical Layer 16.0 GT/s <?> Capabilities: [450] Lane Margining at the Receiver <?> Kernel modules: amdgpu

nktice commented 6 months ago

I am wondering if you have searched on forums for similar stuff - Here for example, are other settings to try for similar things.
https://askubuntu.com/questions/1427868/amdgpu-drivers-not-loading-on-xubuntu-22-04

In the most recent post with lspci there is some info it looks like there are two VGA controllers - so I'm wondering, does your motherboard have some built in video capabilities ? If so, perhaps there is some confusion with them causing some issue. I'd try disabling such hardware in the BIOS and check settings there.

kyancey commented 6 months ago

I went back and disabled the integrated GPU, just to be certain, but it still doesn't appear to work. From what I can gather, it is using the 7900 XTX, but only as a VGA device. I've uninstalled and reinstalled the drivers several times, after changing bios settings, but I just don't think it's going to work for me. Thanks for all your help, but I think I'm going to call it quits.

kyancey commented 6 months ago

Okay, I fixed it right after I gave up. The problem was that secure boot was enabled in bios, and that wasn't allowing the graphics driver to load. I haven't tested everything yet, but my info is showing up in rocminfo, and I was able to get my display to 4k. Thanks. I think this will be the last of it.