Closed boenshao closed 1 year ago
Update, I have installed i915/adlp_guc_70.1.1.bin
manually, but the problem persists, with identical error logs, so this is probably a fundamental problem of 5.19 kernel, as I haven't experienced any freeze on 5.18.
I have a similar issue, after kernel update with those "W: Possible missing firmware /lib/firmware/i915/skl_guc_70.1.1.bin for module i915" warnings, my system freezes for 2-3min during the loading screen and after logging on.
Update, Mark from Lenovo probably found the root cause and the i915 patch.
In Pop, I did capture the same min_size
stack trace as mentioned in the Lenovo forum:
Sep 08 00:32:08.434095 pop-os kernel: i915 0000:00:02.0: enabling device (0006 -> 0007)
Sep 08 00:32:08.434148 pop-os kernel: i915 0000:00:02.0: [drm] VT-d active for gfx access
Sep 08 00:32:08.434199 pop-os kernel: i915 0000:00:02.0: vgaarb: deactivate vga console
Sep 08 00:32:08.434236 pop-os kernel: i915 0000:00:02.0: [drm] Transparent Hugepage mode 'huge=within_size'
Sep 08 00:32:08.434241 pop-os kernel: ------------[ cut here ]------------
Sep 08 00:32:08.434246 pop-os kernel: i915 0000:00:02.0: Block 42 min_size is zero
Sep 08 00:32:08.434251 pop-os kernel: WARNING: CPU: 0 PID: 267 at drivers/gpu/drm/i915/display/intel_bios.c:476 init_bdb_block+0x29c/0x490 [i915]
Sep 08 00:32:08.434256 pop-os kernel: Modules linked in: fjes(-) i915(+) drm_buddy i2c_algo_bit ttm drm_display_helper cec rc_core drm_kms_helper hid_generic syscopyarea crct10dif_pclmul >
Sep 08 00:32:08.434262 pop-os kernel: CPU: 0 PID: 267 Comm: systemd-udevd Not tainted 5.19.0-76051900-generic #202207312230~1660780566~22.04~9d60db1
Sep 08 00:32:08.434266 pop-os kernel: Hardware name: LENOVO 21CBCTO1WW/21CBCTO1WW, BIOS N3AET65W (1.30 ) 08/02/2022
Sep 08 00:32:08.434276 pop-os kernel: RIP: 0010:init_bdb_block+0x29c/0x490 [i915]
Sep 08 00:32:08.434280 pop-os kernel: Code: 8b 7b 08 4c 8b 77 50 4d 85 f6 74 29 4c 89 4d d0 e8 99 2c 52 e0 44 89 e1 4c 89 f2 48 c7 c7 88 3d 8e c0 48 89 c6 e8 c0 35 9c e0 <0f> 0b 4c 8b 4d >
Sep 08 00:32:08.434284 pop-os kernel: RSP: 0018:ffffc0dbc0d27888 EFLAGS: 00010282
Sep 08 00:32:08.434289 pop-os kernel: RAX: 0000000000000000 RBX: ffff9d4b21f88000 RCX: ffffffffa257aee8
Sep 08 00:32:08.434293 pop-os kernel: RDX: 0000000000000000 RSI: 0000000000000003 RDI: 0000000000000001
Sep 08 00:32:08.434297 pop-os kernel: RBP: ffffc0dbc0d278d8 R08: 000000000000002c R09: fffffffffffcf9a8
Sep 08 00:32:08.434302 pop-os kernel: R10: 0000000000ffff0a R11: 000000000000002e R12: 000000000000002a
Sep 08 00:32:08.434306 pop-os kernel: R13: 0000000000000000 R14: ffff9d4b029e40f0 R15: 0000000000000000
Sep 08 00:32:08.434310 pop-os kernel: FS: 00007f25ef0458c0(0000) GS:ffff9d523f400000(0000) knlGS:0000000000000000
Sep 08 00:32:08.434314 pop-os kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 08 00:32:08.434318 pop-os kernel: CR2: 000055a4ba07da38 CR3: 000000011df2c004 CR4: 0000000000770ef0
Sep 08 00:32:08.434323 pop-os kernel: PKRU: 55555554
Sep 08 00:32:08.434326 pop-os kernel: Call Trace:
Sep 08 00:32:08.434331 pop-os kernel: <TASK>
Sep 08 00:32:08.434335 pop-os kernel: intel_bios_init+0x15b/0x620 [i915]
Sep 08 00:32:08.434340 pop-os kernel: ? drm_vblank_worker_init+0x68/0x80 [drm]
Sep 08 00:32:08.434344 pop-os kernel: intel_modeset_init_noirq+0x3a/0x260 [i915]
Sep 08 00:32:08.434348 pop-os kernel: i915_driver_probe+0x1a8/0x4a0 [i915]
Sep 08 00:32:08.434352 pop-os kernel: ? drm_privacy_screen_get+0x174/0x190 [drm]
Sep 08 00:32:08.434356 pop-os kernel: ? acpi_dev_found+0x66/0x80
Sep 08 00:32:08.434361 pop-os kernel: i915_pci_probe+0x58/0x140 [i915]
Sep 08 00:32:08.434365 pop-os kernel: local_pci_probe+0x48/0x90
Sep 08 00:32:08.434369 pop-os kernel: pci_call_probe+0x55/0x180
Sep 08 00:32:08.434373 pop-os kernel: pci_device_probe+0x84/0x120
Sep 08 00:32:08.434377 pop-os kernel: really_probe+0x1d3/0x3b0
Sep 08 00:32:08.434382 pop-os kernel: __driver_probe_device+0x119/0x190
Sep 08 00:32:08.434387 pop-os kernel: driver_probe_device+0x23/0xc0
Sep 08 00:32:08.434391 pop-os kernel: __driver_attach+0xbd/0x1e0
Sep 08 00:32:08.434395 pop-os kernel: ? __device_attach_driver+0x120/0x120
Sep 08 00:32:08.434399 pop-os kernel: bus_for_each_dev+0x7c/0xd0
Sep 08 00:32:08.434403 pop-os kernel: driver_attach+0x1e/0x30
Sep 08 00:32:08.434408 pop-os kernel: bus_add_driver+0x178/0x220
Is it possible to pull this patch for next kernel update?
After a recent update, my system no longer freezes.
uname -a
Linux pop-os 5.19.0-76051900-generic #202207312230~1663791054~22.04~28340d4 SMP PREEMPT_DYNAMIC Wed S x86_64 x86_64 x86_64 GNU/Linux
apt reinstall linux-firmware
still produces
W: Possible missing firmware /lib/firmware/i915/skl_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/bxt_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/kbl_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/glk_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/kbl_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/kbl_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/cml_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/icl_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/ehl_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/ehl_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/tgl_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/tgl_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/dg1_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/tgl_guc_70.1.1.bin for module i915
W: Possible missing firmware /lib/firmware/i915/adlp_guc_70.1.1.bin for module i915
You can download them manually:
cd /lib/firmware/i915
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/skl_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/bxt_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/kbl_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/glk_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/kbl_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/kbl_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/cml_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/icl_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/ehl_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/ehl_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/tgl_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/tgl_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/dg1_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/tgl_guc_70.1.1.bin
wget https://anduin.linuxfromscratch.org/sources/linux-firmware/i915/adlp_guc_70.1.1.bin
# rebuild initrd images - ubuntu
#apt reinstall linux-firmware
I have seen this same issue and will look into pulling any related patches, as well as updating firmware.
I no longer see Possible missing firmware
on apt reinstall linux-firmware
Still crash after 5.19.16 update, indeed there's big regression in kernel 5.19...
Oct 20 11:00:25.663088 pop-os kernel: BUG: kernel NULL pointer dereference, address: 0000000000000000
Oct 20 11:00:25.663895 pop-os kernel: #PF: supervisor read access in kernel mode
Oct 20 11:00:25.686315 pop-os kernel: #PF: error_code(0x0000) - not-present page
Oct 20 11:00:25.697527 pop-os kernel: PGD 0 P4D 0
Oct 20 11:00:25.709197 pop-os kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
Oct 20 11:00:25.709277 pop-os kernel: CPU: 12 PID: 643 Comm: kworker/u32:10 Tainted: G W OE 5.19.16-76051916-generic #202210150742~1666053244~22.04~cf07008
Oct 20 11:00:25.709309 pop-os kernel: Hardware name: LENOVO 21CBCTO1WW/21CBCTO1WW, BIOS N3AET65W (1.30 ) 08/02/2022
Oct 20 11:00:25.709338 pop-os kernel: Workqueue: i915 __i915_gem_free_work [i915]
Oct 20 11:00:25.709588 pop-os kernel: RIP: 0010:__rb_erase_color+0x96/0x250
Oct 20 11:00:25.709619 pop-os kernel: Code: 5d 41 5e 41 5f 5d 31 c0 89 c2 89 c1 89 c6 89 c7 c3 cc cc cc cc 4c 89 e2 49 89 c4 4d 8b 6c 24 08 49 39 d5 75 91 4d 8b 6c 24 10 <41> f6 45 00 01 0f 84 f6 00 00 00 49 >
Oct 20 11:00:25.709837 pop-os kernel: RSP: 0018:ffffbbb2c1c23d78 EFLAGS: 00010246
Oct 20 11:00:25.709862 pop-os kernel: RAX: 00000000009a1fff RBX: ffffffffc04bd6c0 RCX: 00000000009a1fff
Oct 20 11:00:25.709886 pop-os kernel: RDX: 0000000000000000 RSI: ffff9c9ca29895e8 RDI: ffff9c9c8116d9c3
Oct 20 11:00:25.710131 pop-os kernel: RBP: ffffbbb2c1c23da0 R08: 0000000000000000 R09: 0000000000000000
Oct 20 11:00:25.710157 pop-os kernel: R10: ffff9c9d497a0b80 R11: 0000000000000000 R12: ffff9c9c8116d9c3
Oct 20 11:00:25.710178 pop-os kernel: R13: 0000000000000000 R14: ffff9c9ca29895e8 R15: ffff9c9c80059000
Oct 20 11:00:25.710404 pop-os kernel: FS: 0000000000000000(0000) GS:ffff9ca3bf700000(0000) knlGS:0000000000000000
Oct 20 11:00:25.710431 pop-os kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 20 11:00:25.710644 pop-os kernel: CR2: 0000000000000000 CR3: 0000000824410006 CR4: 0000000000770ee0
Oct 20 11:00:25.710669 pop-os kernel: PKRU: 55555554
Oct 20 11:00:25.710689 pop-os kernel: Call Trace:
Oct 20 11:00:25.710709 pop-os kernel: <TASK>
Oct 20 11:00:25.710925 pop-os kernel: drm_mm_interval_tree_remove+0x18d/0x2c0 [drm]
Oct 20 11:00:25.710953 pop-os kernel: drm_mm_remove_node+0x39/0xa0 [drm]
Oct 20 11:00:25.710975 pop-os kernel: i915_gem_object_release_stolen+0x3f/0x70 [i915]
Oct 20 11:00:25.711208 pop-os kernel: __i915_gem_free_object+0x4f/0x110 [i915]
Oct 20 11:00:25.711235 pop-os kernel: i915_gem_flush_free_objects+0x7d/0xb0 [i915]
Oct 20 11:00:25.711412 pop-os kernel: __i915_gem_free_work+0x15/0x20 [i915]
Oct 20 11:00:25.711435 pop-os kernel: process_one_work+0x21c/0x400
Oct 20 11:00:25.711457 pop-os kernel: worker_thread+0x50/0x3f0
Oct 20 11:00:25.711669 pop-os kernel: ? rescuer_thread+0x3a0/0x3a0
Oct 20 11:00:25.711695 pop-os kernel: kthread+0xeb/0x120
Oct 20 11:00:25.711715 pop-os kernel: ? kthread_complete_and_exit+0x20/0x20
Oct 20 11:00:25.711929 pop-os kernel: ret_from_fork+0x1f/0x30
Oct 20 11:00:25.711953 pop-os kernel: </TASK>
Oct 20 11:00:25.711977 pop-os kernel: Modules linked in: hidp uhid uas usb_storage typec_displayport tls xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack_netlink nf_conntrack nf_defrag_ipv6 nf_def>
Oct 20 11:00:25.712242 pop-os kernel: snd_intel_sdw_acpi videobuf2_vmalloc btusb snd_hda_codec videobuf2_memops btrtl pmt_telemetry videobuf2_v4l2 joydev mei_pxp libarc4 btbcm pmt_class intel_rapl_msr snd_se>
Oct 20 11:00:25.712451 pop-os kernel: parport_pc ppdev lp parport ip_tables x_tables autofs4 dm_crypt raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 ra>
Oct 20 11:00:25.712690 pop-os kernel: CR2: 0000000000000000
Oct 20 11:00:25.712716 pop-os kernel: ---[ end trace 0000000000000000 ]---
Oct 20 11:00:25.712940 pop-os kernel: RIP: 0010:__rb_erase_color+0x96/0x250
Oct 20 11:00:25.712960 pop-os kernel: Code: 5d 41 5e 41 5f 5d 31 c0 89 c2 89 c1 89 c6 89 c7 c3 cc cc cc cc 4c 89 e2 49 89 c4 4d 8b 6c 24 08 49 39 d5 75 91 4d 8b 6c 24 10 <41> f6 45 00 01 0f 84 f6 00 00 00 49 >
Oct 20 11:00:25.712984 pop-os kernel: RSP: 0018:ffffbbb2c1c23d78 EFLAGS: 00010246
Oct 20 11:00:25.713005 pop-os kernel: RAX: 00000000009a1fff RBX: ffffffffc04bd6c0 RCX: 00000000009a1fff
Oct 20 11:00:25.713221 pop-os kernel: RDX: 0000000000000000 RSI: ffff9c9ca29895e8 RDI: ffff9c9c8116d9c3
Oct 20 11:00:25.713241 pop-os kernel: RBP: ffffbbb2c1c23da0 R08: 0000000000000000 R09: 0000000000000000
Oct 20 11:00:25.713261 pop-os kernel: R10: ffff9c9d497a0b80 R11: 0000000000000000 R12: ffff9c9c8116d9c3
Oct 20 11:00:25.713278 pop-os kernel: R13: 0000000000000000 R14: ffff9c9ca29895e8 R15: ffff9c9c80059000
Oct 20 11:00:25.713463 pop-os kernel: FS: 0000000000000000(0000) GS:ffff9ca3bf700000(0000) knlGS:0000000000000000
Oct 20 11:00:25.713484 pop-os kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Oct 20 11:00:25.713504 pop-os kernel: CR2: 0000000000000000 CR3: 000000010f74a003 CR4: 0000000000770ee0
Oct 20 11:00:25.713712 pop-os kernel: PKRU: 55555554
No longer encountered any i915 issues on later kernel releases.
Distribution (run
cat /etc/os-release
):Related Application and/or Package Version (run
apt policy $PACKAGE NAME
):Issue/Bug Description: As updating to the 5.19 kernel, below warnings occurred
It seems harmless at first, but I started to get random system freeze, and sometime can't even boot into the system, it's so broken to the point that I couldn't access ttys.
Tracing the log during boot, I found below error:
Some more logs later, the system stopped at:
Steps to reproduce (if you know): Sporadic events, no oblivious steps to reproduce, sorry.
Expected behavior: Loading GuC firmware correctly.
Other Notes: