QubesOS / qubes-issues

The Qubes OS Project issue tracker
https://www.qubes-os.org/doc/issue-tracking/
534 stars 47 forks source link

Display/system freezes every now and then (i915 0000:00:02.0: vblank wait timed out on crtc 1) #9239

Open emanruse opened 4 months ago

emanruse commented 4 months ago

Qubes OS release

4.2.1

Brief summary

While working (e.g. web browsing, typing email to be sent etc) the display freezes and the system becomes unresponsive. The only way out is hardware reset.

Steps to reproduce

No specific.

Expected behavior

A correctly working system.

Actual behavior

Explained in summary.

Additional info:

Looking at the journal after reboot, there are these lines, just before the reboot:

[20179.416272] dom0 kernel: ------------[ cut here ]------------
[20179.416281] dom0 kernel: i915 0000:00:02.0: vblank wait timed out on crtc 1
[20179.416325] dom0 kernel: WARNING: CPU: 1 PID: 5703 at drivers/gpu/drm/drm_vblank.c:1310 drm_wait_one_vblank+0x1f5/0x210
[20179.416346] dom0 kernel: Modules linked in: snd_seq_dummy snd_hrtimer vfat fat snd_sof_pci_intel_cnl snd_sof_intel_hda_common soundwire_intel snd_sof_intel_hda_mlink soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_sof_utils soundwire_generic_allocation soundwire_bus snd_soc_skl snd_soc_hdac_hda snd_hda_ext_core snd_soc_sst_ipc snd_soc_sst_dsp snd_hda_codec_hdmi snd_soc_acpi_intel_match snd_soc_acpi snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_soc_core snd_compress ac97_bus snd_pcm_dmaengine snd_hda_intel iTCO_wdt ee1004 intel_pmc_bxt snd_intel_dspcfg iTCO_vendor_support snd_intel_sdw_acpi intel_powerclamp intel_rapl_msr snd_hda_codec joydev snd_hda_core apple_mfi_fastcharge i2c_i801 snd_hwdep snd_seq i2c_smbus snd_seq_device pcspkr snd_pcm processor_thermal_device_pci_legacy processor_thermal_device processor_thermal_rfim snd_timer processor_thermal_mbox processor_thermal_rapl intel_pmc_core snd intel_rapl_common soundcore int340x_thermal_zone r8169 intel_soc_dts_iosf
[20179.416448] dom0 kernel:  intel_pch_thermal fuse loop xenfs dm_thin_pool dm_persistent_data dm_bio_prison dm_crypt crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni polyval_generic ghash_clmulni_intel sha512_ssse3 hid_apple i915 sha256_ssse3 sha1_ssse3 nvme nvme_core nvme_common i2c_algo_bit drm_buddy video xhci_pci wmi xhci_pci_renesas ttm pinctrl_cannonlake drm_display_helper xhci_hcd cec xen_acpi_processor xen_privcmd xen_pciback xen_blkback xen_gntalloc xen_gntdev xen_evtchn scsi_dh_rdac scsi_dh_emc scsi_dh_alua uinput dm_multipath
[20179.416521] dom0 kernel: CPU: 1 PID: 5703 Comm: Xorg Not tainted 6.6.25-1.qubes.fc37.x86_64 #1
...

which (including the rest of the trace) are almost identical to the ones shown in this bug report:

https://gitlab.freedesktop.org/drm/i915/kernel/-/issues/8685

wp07e commented 4 months ago

I am facing a similar issue (not sure if its the same thing). Since about a week ago after performing an update using the Qubes OS Update manager (I am not sure which Qubes where updates but I am sure Dom0 was updated as well), my system is now constantly freezing and it is random. It could happen after 10 mins after reboot or it could happen after 3 hours but it always happens and when it freezes, the only way to resolve is a hard reboot.

Its coming to a point now where this system is becoming unusable for my daily tasks. Prior to these freezing issues, I have left my desktop running non-stop for months at a time without any freezing and a need to reboot. Since about a week ago, I must now reboot daily.

Edit: Next time I will try running journalctl and see if I can find any errors. Edit1: I checked journalctl on reboot there are no errors or warnings in Dom0, Fedora, Whonix. What the hell is going on? It still happening I must reboot every few hours after the system freezes.

emanruse commented 4 months ago

Too bad.

emanruse commented 4 months ago

I make 100 times more money than you you gimp

???

wp07e commented 4 months ago

Developers I cannot believe this is not a P0 show stopper issue! Within every hour, my computer completely freezes and I am forced to do a hard reboot. There are no warnings or errors in DOM0 using jounalctl. Is there any other way I can debug this issue?

wp07e commented 4 months ago

@emanruse I deleted that message so not sure how you got it but if you cannot help me out please just don't say anything not useful I am not in a good mood right now and losing productivity

marmarek commented 4 months ago

@walkerp07 the issue you linked includes two solutions:

Have you tried any of those?

wp07e commented 4 months ago

@marmarek thank you for your reply. Initially, i had kernel 6.6.29 but I have upgraded dom0 to kernel 6.8.8.1.

I have also followed the instructions here: https://askubuntu.com/questions/1500411/my-laptop-freezes-when-connecting-external-monitor-via-dp-or-hdmi to add the flag GRUB_CMDLINE_LINUX_DEFAULT="i915.dmc_firmware_path= quiet splash"as you have mentioned.

I will report back

wp07e commented 4 months ago

@marmarek no it didn't work. As soon as I opened firefox and started streaming a video, about 2 minutes later the whole computer froze again. It happens sometimes when I am streaming video (i.e. youtube) and sometimes when I am not. I ran firefox on a fedora 38 based Qube. Any other suggestions?

I notice my Fedora qube is still running 6.6.29-1. Do I need to upgrade this kernel as well? If I go to Qube manager -> Fedora 38 -> settings -> Advanced, kernel 6.8 is not listed there. Do you know why?

wp07e commented 4 months ago

@marmarek The system also crashed when I reverted back to 6.6.25 but it lasted a day after reverting back to 6.1.75.

So I have hope here but if you can tell me how to push the latest 6.8 to my other qubes I would appreciate it!

Edit: Computer hasen't crashed after two days non-stop using kernel 6.1.75 on Qubes I use daily with Dom0 still using 6.8. I am 75% confident this is a stable solution (albeit I would rather be running 6.8 on the Qubes as well but its not listed for some reason)

apparatius commented 4 months ago

So I have hope here but if you can tell me how to push the latest 6.8 to my other qubes I would appreciate it!

You need to install kernel-latest-qubes-vm package in dom0: https://www.qubes-os.org/doc/how-to-install-software-in-dom0/#domu

wp07e commented 4 months ago

@apparatius Oh crap! I missed that thank you!

wp07e commented 4 months ago

Hey, I finally had a chance to reboot with all Qubes (Dom0, user Qubes) to 6.8 and my computer froze again.

It seems like my system is stable with certain Qubes running 6.1.75 only. I am trying to figure out exactly what qube needs to be in 6.1.75.

Is anyone else experiencing freezing events due to kernel changes?

wp07e commented 4 months ago

I have all of my Qubes running on 6.8 except the primary Qube that I use for most of my daily tasks. This Qube (which is based on fedora-38-xfce template, using sys-firewall for NetVM) seems to be the one that requires kernel 6.1.75 otherwise the system crashes. I know that because I have switched the kernel and my system has not crashed for over 6 hours (which would be a record had the kernel not been changed).

So... why this Qube? That is the $100 dollar question. If anyone has any insights please let me know.

wp07e commented 4 months ago

PLEASE HELP! My computer is starting to randomly freeze again even with the kernels at 6.1.75. If it is not the kernel could it be hardware related?

I have a newer gaming rig so really doubt its the hardware but is there anyway I can check? Can anyone please help?

andrewdavidwong commented 4 months ago

@walkerp07: Please note that this issue tracker (qubes-issues) is not intended to serve as a help desk or tech support center. Instead, we've set up other venues where you can ask for help and support, ask questions, and have discussions. (By contrast, the issue tracker is more of a technical tool intended to support our developers in their work.) Thank you for your understanding.

wp07e commented 4 months ago

@andrewdavidwong ok I will reach out there thank you.

PetrVladimirov commented 3 months ago

I am facing a similar issue (not sure if its the same thing). Since about a week ago after performing an update using the Qubes OS Update manager (I am not sure which Qubes where updates but I am sure Dom0 was updated as well), my system is now constantly freezing and it is random. It could happen after 10 mins after reboot or it could happen after 3 hours but it always happens and when it freezes, the only way to resolve is a hard reboot.

Its coming to a point now where this system is becoming unusable for my daily tasks. Prior to these freezing issues, I have left my desktop running non-stop for months at a time without any freezing and a need to reboot. Since about a week ago, I must now reboot daily.

Edit: Next time I will try running journalctl and see if I can find any errors. Edit1: I checked journalctl on reboot there are no errors or warnings in Dom0, Fedora, Whonix. What the hell is going on? It still happening I must reboot every few hours after the system freezes.

Exactly the same behavior. No logs in Dom0, even several minutes prior the freeze. Has to power off and start the machine manually. The system is with i915 and used with an external HDMI display, but I've not yet tried the workarounds mentioned above.

wp07e commented 2 months ago

@PetrVladimirov My system is stable ONLY if I run DOM0 and all other Qubes on 6.6.25-1. No other kernel versions work for more than 2 hours without freezing no matter what the combination is between Dom0 and Qubes. It even freezes with the latest kernel 6.9.4-1.

Praying for the day when I can upgrade / change the kenel again

emanruse commented 2 months ago

I haven't seen this happening for a long time. 6.6.31-1 here.

FWIW, while it was happening, this system was with 1/2 of its physical RAM (a total of 32G). After re-adding another RAM module (total 64), it seems the problem stopped appearing.