Open Ceikry opened 1 year ago
For context, I had a Chromium window with about 7 tabs open, most of them github or something equally benign, with exactly one focused tab playing a youtube video.
This was visible in the background, and my focused app was Discord.
I sent a message to my friend, and then got up and went to grab a drink. When I came back, the system had hung.
I could not switch TTY's, REISUB didn't seem to work, and unfortunately I did not have sshd set up to test with at the time.
I am in Nvidia Graphics mode, not hybrid or integrated.
My nvidia powermizer settings are "Prefer Maximum Performance"
I was set to high performance mode in the System76-Power settings as well.
The uptime prior to this crash had been about 6-7 hours, during which time I was interacting with the PC and doing things often enough that one cannot say it was "idle."
Ahh, this looks like the processor C-State, which I found a fix for. Is yours a intel or amd processor?
AMD
Ok, try the following:
sudo kernelstub -a "processor.max_cstate=1"
sudo kernelstub -a "intel_idle.max_cstate=0"
This almost entirely disables the processor's "sleep state", which is a huge issue with AMD processors on linux right now. Give it a few days and see if this clears it up.
Alright, I've issued the command. I'll keep this thread updated with the results!
Great, reboot and hopefully we don't hear from you again!
Unfortunately, that almost certainly did not work. It went from intermittent crashes every few hours, to a consistent crash with no logs at all about 5 minutes after I started any cpu intensive task.
Are you able to use SSH from another computer to log into this system after it has gotten stuck?
Ok, to reverse the kernelstub changes, change the -a to -d in those commands.
Are you able to use SSH from another computer to log into this system after it has gotten stuck?
I have not had the chance to try, but I have sshd set up and running now, so if it happens again I'll be able to check.
It happened again, and I was unable to connect via SSH. I did test the SSH connection before the crash, and it worked fine, then after the crash it refused to connect, claiming no route to host. the last 2-3 seconds of audio were looping as well.
These are the only logs close to the time it happened, and actually occurred merely 1 or 2 seconds before the crash:
Jan 08 10:22:01 pop-os org.chromium.Chromium.desktop[9376]: Fontconfig error: Cannot load default config file: No such file: (null)
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:2311:0108/102203.814166:ERROR:vaapi_video_decoder.cc(1206)] failed Initialize()ing the frame pool
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.821010:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.821214:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.823059:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.823225:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.824842:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.824974:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.828563:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.828698:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.830345:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
Jan 08 10:22:03 pop-os org.chromium.Chromium.desktop[8163]: [1290:1290:0108/102203.830454:ERROR:shared_image_factory.cc(575)] Could not find SharedImageBackingFactory with pa>
I've been having FontConfig crashes too and I can't figure them out! Driving me nuts.
Distribution (run
cat /etc/os-release
):Related Application and/or Package Version (run
apt policy $PACKAGE NAME
):Issue/Bug Description: Pop shop seems to inexplicably cause a full system hang (with no access to other tty's or anything - fully unresponsive) seemingly at random.
Steps to reproduce (if you know): Unfortunately, it seems to be entirely random.
Expected behavior: For it not to freeze my entire system.
Other Notes:
My entire system hung up while I left it idle to go make a drink. Came back, noticed the system was frozen, and looked at the clock on my PC to see the time it froze at. =
The system froze at exactly 00:14:00.
Rebooted using the power button, and checked journalctl for all logs since 00:10
At exactly 00:13:51, with nothing else in the log before or after, it had these messages about pop-shop:
naturally, the only conclusion I can draw is that it was either a cosmic coincidence (get it?), or this was the cause of the indefinite hang.
Neofetch for additional info: