pop-os / pop

A project for managing all Pop!_OS sources
https://system76.com/pop
2.47k stars 87 forks source link

Unexpected freeze/hang/crash with pop-shop/io.elementary.appcenter-daemon #2753

Open Ceikry opened 1 year ago

Ceikry commented 1 year ago

Distribution (run cat /etc/os-release):

NAME="Pop!_OS"
VERSION="22.04 LTS"
ID=pop
ID_LIKE="ubuntu debian"
PRETTY_NAME="Pop!_OS 22.04 LTS"
VERSION_ID="22.04"
HOME_URL="https://pop.system76.com"
SUPPORT_URL="https://support.system76.com"
BUG_REPORT_URL="https://github.com/pop-os/pop/issues"
PRIVACY_POLICY_URL="https://system76.com/privacy"
VERSION_CODENAME=jammy
UBUNTU_CODENAME=jammy
LOGO=distributor-logo-pop-os

Related Application and/or Package Version (run apt policy $PACKAGE NAME):

pop-shop:
  Installed: 3.4.2pop0~1666654031~22.04~de9193e
  Candidate: 3.4.2pop0~1666654031~22.04~de9193e
  Version table:
 *** 3.4.2pop0~1666654031~22.04~de9193e 1001
       1001 http://apt.pop-os.org/release jammy/main amd64 Packages
        100 /var/lib/dpkg/status

Issue/Bug Description: Pop shop seems to inexplicably cause a full system hang (with no access to other tty's or anything - fully unresponsive) seemingly at random.

Steps to reproduce (if you know): Unfortunately, it seems to be entirely random.

Expected behavior: For it not to freeze my entire system.

Other Notes:

My entire system hung up while I left it idle to go make a drink. Came back, noticed the system was frozen, and looked at the clock on my PC to see the time it froze at. =

The system froze at exactly 00:14:00.

Rebooted using the power button, and checked journalctl for all logs since 00:10

At exactly 00:13:51, with nothing else in the log before or after, it had these messages about pop-shop:

Jan 08 00:13:51 pop-os io.elementary.appcenter-daemon.desktop[7423]: [40:40:0108/001351.107681:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with params: usage: Gles2|Raste>
Jan 08 00:13:51 pop-os io.elementary.appcenter-daemon.desktop[7423]: [40:40:0108/001351.107791:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with params: usage: Gles2|Raste>

naturally, the only conclusion I can draw is that it was either a cosmic coincidence (get it?), or this was the cause of the indefinite hang.

Neofetch for additional info:

OS: Pop!_OS 22.04 LTS x86_64 
Host: ROG Zephyrus G14 GA401IH_GA401IH 1.0 
Kernel: 6.0.12-76060006-generic 
Uptime: 6 mins 
Packages: 2039 (dpkg), 10 (flatpak) 
Shell: bash 5.1.16 
Resolution: 1920x1080 
DE: GNOME 42.3.1 
WM: Mutter 
WM Theme: Pop 
Theme: Pop-dark [GTK2/3] 
Icons: Pop [GTK2/3] 
Terminal: gnome-terminal 
CPU: AMD Ryzen 7 4800HS with Radeon Graphics (16) @ 2.900GHz
GPU: NVIDIA 01:00.0 NVIDIA Corporation TU117M (GTX 1650)
GPU: AMD ATI 04:00.0 Renoir 
Memory: 2139MiB / 15413MiB 
Ceikry commented 1 year ago

For context, I had a Chromium window with about 7 tabs open, most of them github or something equally benign, with exactly one focused tab playing a youtube video.

This was visible in the background, and my focused app was Discord.

I sent a message to my friend, and then got up and went to grab a drink. When I came back, the system had hung.

I could not switch TTY's, REISUB didn't seem to work, and unfortunately I did not have sshd set up to test with at the time.

I am in Nvidia Graphics mode, not hybrid or integrated.

My nvidia powermizer settings are "Prefer Maximum Performance"

I was set to high performance mode in the System76-Power settings as well.

Ceikry commented 1 year ago

The uptime prior to this crash had been about 6-7 hours, during which time I was interacting with the PC and doing things often enough that one cannot say it was "idle."

Thegrayjedi commented 1 year ago

Ahh, this looks like the processor C-State, which I found a fix for. Is yours a intel or amd processor?

Ceikry commented 1 year ago

AMD

Thegrayjedi commented 1 year ago

Ok, try the following:

sudo kernelstub -a "processor.max_cstate=1"
sudo kernelstub -a "intel_idle.max_cstate=0"

This almost entirely disables the processor's "sleep state", which is a huge issue with AMD processors on linux right now. Give it a few days and see if this clears it up.

Ceikry commented 1 year ago

Alright, I've issued the command. I'll keep this thread updated with the results!

Thegrayjedi commented 1 year ago

Great, reboot and hopefully we don't hear from you again!

Ceikry commented 1 year ago

Unfortunately, that almost certainly did not work. It went from intermittent crashes every few hours, to a consistent crash with no logs at all about 5 minutes after I started any cpu intensive task.

mmstick commented 1 year ago

Are you able to use SSH from another computer to log into this system after it has gotten stuck?

Thegrayjedi commented 1 year ago

Ok, to reverse the kernelstub changes, change the -a to -d in those commands.

Ceikry commented 1 year ago

Are you able to use SSH from another computer to log into this system after it has gotten stuck?

I have not had the chance to try, but I have sshd set up and running now, so if it happens again I'll be able to check.

Ceikry commented 1 year ago

It happened again, and I was unable to connect via SSH. I did test the SSH connection before the crash, and it worked fine, then after the crash it refused to connect, claiming no route to host. the last 2-3 seconds of audio were looping as well.

Ceikry commented 1 year ago

These are the only logs close to the time it happened, and actually occurred merely 1 or 2 seconds before the crash:

Jan 08 10:22:01 pop-os org.chromium.Chromium.desktop[9376]: Fontconfig error: Cannot load default config file: No such file: (null)
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:2311:0108/102203.814166:ERROR:vaapi_video_decoder.cc(1206)] failed Initialize()ing the frame pool
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.821010:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.821214:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.823059:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.823225:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.824842:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.824974:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.828563:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.828698:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.830345:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.830454:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Thegrayjedi commented 1 year ago

I've been having FontConfig crashes too and I can't figure them out! Driving me nuts.