Closed bkuhls closed 3 months ago
Whilst not identical, as a data-point, I have tested the same microcode on a very similar machine (but running the older 5.8.0-43-generic
kernel from Ubuntu 20.04) and it appeared to boot without issue (or at least enough that I could remotely log in via SSH and capture /proc/cpuinfo - I don't have physical access to this machine unfortunately nor can I easily test other newer kernel versions unfortunately):
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 167
model name : 11th Gen Intel(R) Core(TM) i9-11900 @ 2.50GHz
stepping : 1
microcode : 0x59
Can you first boot with cmdline dis_ucode_ldr. This will disable early loading. Then report what version comes from BIOS?
You can also recreate an initrd without the suspect ucode, then try to late-load it via
# echo 1 > /sys/devices/system/cpu/microcode/reload
And report back.
Can you first boot with cmdline dis_ucode_ldr. This will disable early loading. Then report what version comes from BIOS?
# grep microcode /proc/cpuinfo | uniq
microcode : 0x58
This result occurs after a cold boot, board is an ASUS Z590 Extreme with BIOS v2.20 dating 2023/4/7. I updated the BIOS in May, before BIOS v1.90, dating 2021/7/29, was installed. Since May 2022, when I built the machine, I updated the microcode starting with version 20220510, and used all five revisions since then, and never had any problems.
You can also recreate an initrd without the suspect ucode, then try to late-load it via
# echo 1 > /sys/devices/system/cpu/microcode/reload
I need to recompile my self-compiled kernel for this, the subdirectory microcode/ is not present atm, and will report back.
Since I am chain-loading two initrd files, one of them containing the microcode, using syslinux there is no need to recreate the initrd, I just obmit the microcode image from the kernel cmdline.
EDIT: A reboot without dis_ucode_ldr was enough ;) Here is the result of the microcode reload:
kern.info kernel: microcode: updated to revision 0x59, date = 2023-02-26
kern.info kernel: microcode: Reload completed, microcode revision: 0x59
Approx. 10s later the machine freezes without any messages on the terminal (headless server here) or in syslog.
The issue still exists with version 20231114.
Thanks.. I'll have someone look into that... I didn't fully understand the linked initrd.. Are you saying that you have two entries. one with the ucode in initrd and another without the ucode?
So after the late-load, you noticed system updated from 0x58->0x59.. but system froze after that. Did I understand it correctly
I didn't fully understand the linked initrd.. Are you saying that you have two entries. one with the ucode in initrd and another without the ucode?
The bootloader contains one entry consisting of two initrd files which are being chain-loaded. The first initrd file is created by iucode_tool --write-earlyfw, the second initrd file contains the linux system.
So after the late-load, you noticed system updated from 0x58->0x59.. but system froze after that. Did I understand it correctly
Yes.
The bug is fixed with version 20240813, thanks!
# dmesg | grep sig=
microcode: sig=0xa0671, pf=0x2, revision=0x62
# cat /proc/cpuinfo
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 167
model name : 11th Gen Intel(R) Core(TM) i9-11900T @ 1.50GHz
stepping : 1
microcode : 0x62
It seems my machine suffered from erratum TGL068 "Processor May Hang During a Microcode Update"
Thanks for the update!
Uh, TGL068 == ADL075 == ICL088. Hopefully all of them are fixed already in 20240813...
Uh, TGL068 == ADL075 == ICL088. Hopefully all of them are fixed already in 20240813...
Yes, that is correct. The changes for this issue were included in the May and Aug updates.
With version 20230512 everything is fine but using the latest version 20230808 the system hangs during boot.
The img file is used with syslinux
which works fine with the previous microcode
but with the current microcode on screen only three lines appear about loading intel-microcode.img, rootfs.img and the Linux kernel (5.14.21) itself, no messages from the kernel itself appear, the system is frozen.