clearlinux / distribution

Placeholder repository to allow filing of general bugs/issues/etc against the Clear Linux OS for Intel Architecture linux distribution
524 stars 29 forks source link

Installers don't come up on an AMD CPU / GPU system #796

Open znmeb opened 5 years ago

znmeb commented 5 years ago

I have a 2013-vintage workstation with an AMD processor and an AMD GPU. Specifically:

$ lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD9x0/RX980 Host Bridge (rev 02)
00:02.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890/RD9x0/RX980 PCI to PCI bridge (PCI Express GFX port 0)
00:04.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890/RD9x0/RX980 PCI to PCI bridge (PCI Express GPP Port 0)
00:05.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890/RD9x0/RX980 PCI to PCI bridge (PCI Express GPP Port 1)
00:07.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] RD890/RD9x0/RX980 PCI to PCI bridge (PCI Express GPP Port 3)
00:11.0 SATA controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode] (rev 40)
00:12.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:12.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:13.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:13.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 SMBus Controller (rev 42)
00:14.2 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 Azalia (Intel HDA) (rev 40)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 LPC host controller (rev 40)
00:14.4 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] SBx00 PCI to PCI Bridge (rev 40)
00:14.5 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI2 Controller
00:16.0 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB OHCI0 Controller
00:16.2 USB controller: Advanced Micro Devices, Inc. [AMD/ATI] SB7x0/SB8x0/SB9x0 USB EHCI Controller
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h Processor Function 5
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Bonaire XT [Radeon HD 7790/8770 / R7 360 / R9 260/360 OEM]
01:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Device 0002
02:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 09)
03:00.0 USB controller: ASMedia Technology Inc. ASM1042A USB 3.0 Host Controller
04:00.0 USB controller: ASMedia Technology Inc. ASM1042A USB 3.0 Host Controller

The live desktop and server ISOs boot up, and they display what they're supposed to display, but the input is frozen - neither the mouse or the keyboard work. So I can't even do Ctl-Alt-F4 and troubleshoot.

None of the other Linux distros I've tried do this. Sometimes I have to blacklist the amdgpu module on recent kernels, but other than that, they all come up to a live desktop. I tried that with Clear's ISOs and it didn't help.

So it's either something in the Clear Linux kernel or the kernel command line parameters. Is there any way I can troubleshoot this - there's nothing left on the screen to take a picture of.

znmeb commented 5 years ago

Could having the Intel kvm kernel module compiled into the kernel cause this? See https://github.com/clearlinux/distribution/issues/526

lebensterben commented 5 years ago

Could having the Intel kvm kernel module compiled into the kernel cause this? See https://github.com/clearlinux/distribution/issues/526

@znmeb Just wondering what's the type of you keyboard and mouse? Wired or wireless? If wired, USB or PS/2? If wireless, is it bluetooth?

Sent with GitHawk

znmeb commented 5 years ago

@lebensterben USB wireless - one of those integrated gizmos from Logitech

Bus 005 Device 002: ID 046d:c52b Logitech, Inc. Unifying Receiver

Here's a list of all the modules loaded (Fedora Silverblue 30):

$ uname -a
Linux AlgoCompSynth 5.0.17-300.fc30.x86_64 #1 SMP Mon May 20 15:36:26 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux

module-check.txt

lebensterben commented 5 years ago

@lebensterben USB wireless - one of those integrated gizmos from Logitech ...

@znmeb I saw another post whose author says Logitech mouse doesn't work with Bluetooth connection but with unifying receiver. But in your case it doesn't seems to be a problem cause by Bluetooth.

Sent with GitHawk

znmeb commented 5 years ago

@lebensterben I've successfully installed in a virtual machine - there I can get a Ctl-Alt-F4. But the emulator is emulating a "USB tablet", not a unified wireless receiver. I could try loading the modules in a VM and see if they throw an error.

Screenshot_CkearLinuxBase_2019-05-24_02:52:10

miguelinux commented 5 years ago

Can you test with an USB-Keyboard, if that works, then we need to add the missing drivers.

znmeb commented 5 years ago

I don't have another keyboard. It would be easier for me to build an image with the drivers - which ones are they?

miguelinux commented 5 years ago

I don't have another keyboard. It would be easier for me to build an image with the drivers - which ones are they?

I don't have the HW, so I can not test with it, and find the missing drivers :-( tha's why I was asking for you help to identify the HW ones it is booted.

or if you have another distro, please paste-in the lsmod of that distro, so I can look for a driver related to your keyboard.

znmeb commented 5 years ago

It's running Silverblue 30 at the moment - the lsmod is attached in https://github.com/clearlinux/distribution/issues/796#issuecomment-495537912

znmeb commented 5 years ago

arcolinux-boot-modules.txt

This is ArcoLinux

Linux ArcoLinux 5.0.10-arch1-1-ARCH #1 SMP PREEMPT Sat Apr 27 20:06:45 UTC 2019 x86_64 GNU/Linux
bwarden commented 5 years ago

Could having the Intel kvm kernel module compiled into the kernel cause this? See #526

No, the Intel kvm driver is supposed to be well-behaved and politely disappear when no Intel kvm hardware features are detected, leaving the AMD kvm driver to do whatever it wants when it's loaded later.

please paste-in the lsmod of that distro

@miguelinux can correct me if I'm wrong, but I'll bet he was looking for lsusb

miguelinux commented 5 years ago

from the modules:

we already have

hid_logitech_dj
hid_logitech_hidpp

and hid_logitech_dj is the one for Bus 005 Device 002: ID 046d:c52b Logitech, Inc. Unifying Receiver

$ modinfo hid_logitech_dj
filename:       /lib/modules/5.0.18-767.native/kernel/drivers/hid/hid-logitech-dj.ko
author:         nlopezcasad@logitech.com
author:         Nestor Lopez Casado
author:         Logitech
license:        GPL
alias:          hid:b0003g*v0000046Dp0000C532
alias:          hid:b0003g*v0000046Dp0000C52B
depends:        
retpoline:      Y
intree:         Y
name:           hid_logitech_dj
vermagic:       5.0.18-767.native SMP mod_unload 
sig_id:         PKCS#7
signer:         
sig_key:        
sig_hashalgo:   unknown
signature:      
miguelinux commented 5 years ago

Could you add

modules_load=hid_logitech_dj

to the kernel boot parameters.

miguelinux commented 5 years ago

I think I know what is happening here.

our new installer is using an initrd and that initrd is not adding the missing modules.

Please try by adding modules_load=hid_logitech_dj

znmeb commented 5 years ago

@miguelinux I'm away from that machine for a few hours ... it'll be about 22:00 America/Los_Angeles before I am back. I'll try when I get home.

miguelinux commented 5 years ago

@znmeb i found the root cause, it is missing some kernel modules in the initrd form the ISO installer.

Currently let's try the workaround of adding modules_load=hid_logitech_dj

QA opened an issue to follow this to clr-installer: https://github.com/clearlinux/clr-installer/issues/418

znmeb commented 5 years ago

It didn't work - I tried both modules_load=hid_logitech_dj and modules_load=usbhid,hid_generic,hid_logitech_dj,hid_logitech_hidpp. Those were all loaded in the successful boot of the ArcoLinux D installer.

I booted the installer in a Virtual Machine Manager VM and did find /lib/modules -name '*hid*'. File is attached.hid_genericandusbhid` don't seem to be there. Are they compiled into the kernel?

hid_modules.txt

miguelinux commented 5 years ago

I booted the installer in a Virtual Machine Manager VM and did find /lib/modules -name '*hid*'. File is attached.hid_genericandusbhid` don't seem to be there. Are they compiled into the kernel?

yes,

$ grep hid /usr/lib/modules/$(uname -r)/modules.builtin kernel/drivers/hid/hid.ko kernel/drivers/hid/hid-generic.ko kernel/drivers/hid/hid-belkin.ko kernel/drivers/hid/hid-cherry.ko kernel/drivers/hid/hid-chicony.ko kernel/drivers/hid/hid-holtek-kbd.ko kernel/drivers/hid/hid-holtek-mouse.ko kernel/drivers/hid/hid-holtekff.ko kernel/drivers/hid/hid-kensington.ko kernel/drivers/hid/hid-lenovo.ko kernel/drivers/hid/hid-logitech.ko kernel/drivers/hid/hid-microsoft.ko kernel/drivers/hid/hid-samsung.ko kernel/drivers/hid/usbhid/usbhid.ko

znmeb commented 5 years ago

@miguelinux So the modules_load should have worked. Just out of curiosity is there an older installer without the initrd I could try?

znmeb commented 5 years ago

I have a little news. I did find an old keyboard (wired) that works. So I was able to bring the installer up and CTL-ALT-F4 and log in.

But

  1. modules_load=hid_logitech_dj doesn't load a module. There is probably another way to do it but I couldn't find it on the web anywhere.
  2. When I brought the machine up and did modprobe hid_logitech_dj and modprobe hid_logitech_hidpp the modules loaded but the machine still didn't see the mouse. It wasn't even in lsusb. Is there some udev command required?

Update: 2019-05-26 18-08-46 PST8PDT - I'm going to be tied up on other projects until June 8 and won't be able to do any more hardware troubleshooting on this issue. I'll still be able to troubleshoot https://github.com/clearlinux/distribution/issues/749 on a Fedora Silverblue 30 host, however.

znmeb commented 5 years ago

I'm back at this for a day or so - here's the latest status:

  1. I have a wired keyboard now, so I can boot the USB drive and log in from the command line via ctl-alt-f4.
  2. It still doesn't see the wireless mouse. I've attached lsmod, journalctl and swupd check-update traces.

swupd-check-update.txt journalctl.txt lsmod.txt

  1. Arch Linux lsmod:
    $ lsmod
    Module                  Size  Used by
    cpufreq_ondemand       16384  8
    isofs                  49152  1
    nf_conntrack_netlink    53248  0
    nfnetlink              16384  2 nf_conntrack_netlink
    xfrm_user              45056  1
    xfrm_algo              16384  1 xfrm_user
    xt_addrtype            16384  2
    br_netfilter           28672  0
    overlay               131072  0
    fuse                  131072  3
    xt_state               16384  0
    nf_nat_h323            24576  0
    nf_conntrack_h323      81920  1 nf_nat_h323
    nf_nat_pptp            20480  0
    nf_conntrack_pptp      20480  1 nf_nat_pptp
    nf_nat_tftp            16384  0
    nf_conntrack_tftp      20480  1 nf_nat_tftp
    nf_nat_sip             20480  0
    nf_conntrack_sip       40960  1 nf_nat_sip
    nf_nat_irc             20480  0
    nf_conntrack_irc       20480  1 nf_nat_irc
    nf_nat_ftp             20480  0
    nf_conntrack_ftp       24576  1 nf_nat_ftp
    xt_CHECKSUM            16384  1
    ipt_MASQUERADE         20480  5
    xt_conntrack           16384  3
    ipt_REJECT             16384  4
    nf_reject_ipv4         16384  1 ipt_REJECT
    xt_tcpudp              20480  10
    ip6table_mangle        16384  1
    ip6table_nat           16384  1
    iptable_mangle         16384  1
    iptable_nat            16384  1
    nf_nat                 45056  9 nf_nat_irc,ip6table_nat,nf_nat_ftp,ipt_MASQUERADE,nf_nat_tftp,nf_nat_pptp,nf_nat_h323,iptable_nat,nf_nat_sip
    nf_conntrack          159744  17 xt_conntrack,nf_nat_irc,nf_nat,nf_conntrack_tftp,nf_nat_ftp,xt_state,nf_conntrack_pptp,ipt_MASQUERADE,nf_nat_tftp,nf_conntrack_sip,nf_conntrack_h323,nf_nat_pptp,nf_conntrack_irc,nf_conntrack_netlink,nf_conntrack_ftp,nf_nat_h323,nf_nat_sip
    nf_defrag_ipv6         24576  1 nf_conntrack
    nf_defrag_ipv4         16384  1 nf_conntrack
    libcrc32c              16384  2 nf_conntrack,nf_nat
    ebtable_filter         16384  0
    ebtables               40960  1 ebtable_filter
    ip6table_filter        16384  1
    ip6_tables             36864  3 ip6table_filter,ip6table_nat,ip6table_mangle
    iptable_filter         16384  1
    tun                    57344  1
    bridge                204800  1 br_netfilter
    stp                    16384  1 bridge
    llc                    16384  2 bridge,stp
    arc4                   16384  2
    nls_iso8859_1          16384  2
    nls_cp437              20480  2
    vfat                   20480  2
    fat                    86016  1 vfat
    amdgpu               3956736  13
    mousedev               24576  0
    edac_mce_amd           32768  0
    kvm_amd               106496  0
    uas                    28672  0
    ccp                    98304  1 kvm_amd
    usb_storage            77824  3 uas
    rng_core               16384  1 ccp
    hid_logitech_hidpp     49152  0
    kvm                   741376  1 kvm_amd
    irqbypass              16384  1 kvm
    rt2800usb              28672  0
    rt2x00usb              24576  1 rt2800usb
    rt2800lib             126976  1 rt2800usb
    rt2x00lib              81920  3 rt2800usb,rt2x00usb,rt2800lib
    8812au               1601536  0
    mac80211              958464  3 rt2x00lib,rt2x00usb,rt2800lib
    crct10dif_pclmul       16384  1
    crc32_pclmul           16384  0
    eeepc_wmi              16384  0
    hid_logitech_dj        24576  0
    asus_wmi               32768  1 eeepc_wmi
    hid_generic            16384  0
    wmi_bmof               16384  0
    mxm_wmi                16384  0
    sparse_keymap          16384  1 asus_wmi
    ghash_clmulni_intel    16384  0
    chash                  16384  1 amdgpu
    gpu_sched              36864  1 amdgpu
    amd_iommu_v2           20480  1 amdgpu
    i2c_algo_bit           16384  1 amdgpu
    ttm                   114688  1 amdgpu
    snd_hda_codec_realtek   126976  1
    drm_kms_helper        212992  1 amdgpu
    snd_hda_codec_generic    90112  1 snd_hda_codec_realtek
    cfg80211              815104  3 rt2x00lib,8812au,mac80211
    snd_hda_codec_hdmi     61440  1
    ledtrig_audio          16384  2 snd_hda_codec_generic,snd_hda_codec_realtek
    usbhid                 65536  0
    snd_hda_intel          49152  4
    hid                   143360  4 usbhid,hid_generic,hid_logitech_dj,hid_logitech_hidpp
    snd_hda_codec         155648  4 snd_hda_codec_generic,snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec_realtek
    drm                   495616  7 gpu_sched,drm_kms_helper,amdgpu,ttm
    rfkill                 28672  7 asus_wmi,cfg80211
    snd_hda_core          102400  5 snd_hda_codec_generic,snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec,snd_hda_codec_realtek
    aesni_intel           372736  0
    snd_hwdep              16384  1 snd_hda_codec
    input_leds             16384  0
    snd_pcm               135168  4 snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec,snd_hda_core
    agpgart                53248  2 ttm,drm
    aes_x86_64             20480  1 aesni_intel
    crypto_simd            16384  1 aesni_intel
    cryptd                 28672  3 crypto_simd,ghash_clmulni_intel,aesni_intel
    glue_helper            16384  1 aesni_intel
    snd_timer              40960  1 snd_pcm
    r8169                  94208  0
    syscopyarea            16384  1 drm_kms_helper
    sysfillrect            16384  1 drm_kms_helper
    snd                    98304  16 snd_hda_codec_generic,snd_hda_codec_hdmi,snd_hwdep,snd_hda_intel,snd_hda_codec,snd_hda_codec_realtek,snd_timer,snd_pcm
    sysimgblt              16384  1 drm_kms_helper
    sp5100_tco             20480  0
    realtek                20480  1
    fb_sys_fops            16384  1 drm_kms_helper
    libphy                 94208  2 r8169,realtek
    soundcore              16384  1 snd
    i2c_piix4              28672  0
    k10temp                16384  0
    fam15h_power           16384  0
    pcc_cpufreq            20480  0
    evdev                  20480  8
    wmi                    32768  3 asus_wmi,wmi_bmof,mxm_wmi
    mac_hid                16384  0
    acpi_cpufreq           28672  0
    usbip_host             36864  0
    usbip_core             36864  1 usbip_host
    msr                    16384  0
    sg                     40960  0
    crypto_user            16384  0
    ip_tables              32768  3 iptable_filter,iptable_nat,iptable_mangle
    x_tables               49152  14 ebtables,ip6table_filter,xt_conntrack,iptable_filter,xt_state,xt_tcpudp,ipt_MASQUERADE,xt_addrtype,xt_CHECKSUM,ip6_tables,ipt_REJECT,ip_tables,ip6table_mangle,iptable_mangle
    ext4                  749568  3
    crc32c_generic         16384  0
    crc16                  16384  1 ext4
    mbcache                16384  1 ext4
    jbd2                  131072  1 ext4
    sd_mod                 57344  11
    ohci_pci               20480  0
    serio_raw              20480  0
    atkbd                  36864  0
    libps2                 20480  1 atkbd
    ahci                   40960  5
    libahci                40960  1 ahci
    libata                274432  2 libahci,ahci
    crc32c_intel           24576  7
    xhci_pci               20480  0
    scsi_mod              249856  5 sd_mod,usb_storage,uas,libata,sg
    xhci_hcd              266240  1 xhci_pci
    ehci_pci               20480  0
    ehci_hcd               98304  1 ehci_pci
    ohci_hcd               57344  1 ohci_pci
    i8042                  32768  0
    serio                  28672  4 serio_raw,atkbd,i8042

If you want a journalctl from Arch I can capture one. Kernel is

$ uname -a
Linux AlgoCompSynth 5.1.11-arch1-1-ARCH #1 SMP PREEMPT Mon Jun 17 18:56:30 UTC 2019 x86_64 GNU/Linux