vladkinoman / just-my-notes

Just my awesome notes :sunglasses:
11 stars 1 forks source link

Unsolved errors on Pop!_OS 20.10 #6

Closed vladkinoman closed 2 years ago

vladkinoman commented 3 years ago

Output from inxi:

vladkinoman@pop-os:~$ inxi -Fxz
System:
  Kernel: 5.8.0-7630-generic x86_64 bits: 64 compiler: gcc 
  v: 10.2.0 Desktop: N/A Distro: Pop!_OS 20.10 
  base: Ubuntu 20.10 Groovy 
Machine:
  Type: Laptop System: Hewlett-Packard 
  product: HP ProBook 450 G0 v: A2018CD200 serial: <filter> 
  Mobo: Hewlett-Packard model: 1949 v: KBC Version 90.0F 
  serial: <filter> BIOS: Hewlett-Packard v: 68IRF Ver. F.70 
  date: 04/11/2019 
Battery:
  ID-1: BAT0 charge: 23.7 Wh condition: 23.7/23.7 Wh (100%) 
  model: Hewlett-Packard Primary status: Full 
CPU:
  Info: Dual Core model: Intel Core i5-3230M bits: 64 
  type: MT MCP arch: Ivy Bridge rev: 9 L2 cache: 3072 KiB 
  flags: avx lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 
  bogomips: 20753 
  Speed: 1698 MHz min/max: 1200/3200 MHz Core speeds (MHz): 
  1: 1749 2: 1629 3: 1642 4: 1658 
Graphics:
  Device-1: Intel 3rd Gen Core processor Graphics 
  vendor: Hewlett-Packard driver: i915 v: kernel 
  bus ID: 00:02.0 
  Device-2: AMD Mars [Radeon HD 8670A/8670M/8750M] 
  vendor: Hewlett-Packard driver: amdgpu v: kernel 
  bus ID: 01:00.0 
  Device-3: Cheng Uei Precision Industry (Foxlink) type: USB 
  driver: uvcvideo bus ID: 1-1.3:3 
  Display: x11 server: X.Org 1.20.9 
  driver: amdgpu,ati,modesetting unloaded: fbdev,vesa 
  resolution: 1366x768~60Hz 
  OpenGL: renderer: Mesa DRI Intel HD Graphics 4000 (IVB GT2) 
  v: 4.2 Mesa 20.2.6 direct render: Yes 
Audio:
  Device-1: Intel 7 Series/C216 Family High Definition Audio 
  vendor: Hewlett-Packard driver: snd_hda_intel v: kernel 
  bus ID: 00:1b.0 
  Sound Server: ALSA v: k5.8.0-7630-generic 
Network:
  Device-1: Ralink RT3290 Wireless 802.11n 1T/1R PCIe 
  vendor: Hewlett-Packard driver: rt2800pci v: 2.3.0 port: 3000 
  bus ID: 04:00.0 
  IF: wlo1 state: up mac: <filter> 
  Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit 
  Ethernet 
  vendor: Hewlett-Packard driver: r8169 v: kernel port: 2000 
  bus ID: 05:00.0 
  IF: enp5s0 state: down mac: <filter> 
Drives:
  Local Storage: total: 698.64 GiB used: 378.25 GiB (54.1%) 
  ID-1: /dev/sda vendor: Western Digital 
  model: WD7500BPVT-60HXZT3 size: 698.64 GiB temp: 33 C 
Partition:
  ID-1: / size: 65.60 GiB used: 11.16 GiB (17.0%) fs: ext4 
  dev: /dev/sda6 
  ID-2: /home size: 99.90 GiB used: 63.59 GiB (63.7%) fs: ext4 
  dev: /dev/sda5 
Swap:
  Alert: No Swap data was found. 
Sensors:
  System Temperatures: cpu: 57.0 C mobo: 0.0 C gpu: amdgpu 
  temp: 48.0 C 
  Fan Speeds (RPM): N/A 
Info:
  Processes: 255 Uptime: 19m Memory: 7.63 GiB 
  used: 2.68 GiB (35.1%) Init: systemd runlevel: 5 Compilers: 
  gcc: 10.2.0 Packages: 2721 Shell: Bash v: 5.0.17 inxi: 3.1.07

What I solved already:

  1. PulseAudio issue. Described here.
  2. AmdGpu issue. Described here and here.
  3. Journalctl issue. I couldn't read the journal without the sudo command. So, I used the usermod command to add myself to adm group. Described in this discussion.

New issues:

vladkinoman@pop-os:~$ journalctl -xb -p err
-- Logs begin at Fri 2021-01-08 15:23:33 EET, end at Sat 2021-01-09 18:17:35 EET. --
Jan 09 16:13:26 pop-os kernel: x86/cpu: VMX (outside TXT) disabled by BIOS
Jan 09 16:13:26 pop-os kernel: kfd kfd: OLAND  not supported in kfd
Jan 09 16:14:31 pop-os gdm-password][1328]: gkr-pam: unable to locate daemon control file
Jan 09 16:14:46 pop-os systemd[1333]: Failed to start Application launched by gnome-session-binary.
░░ Subject: A start job for unit UNIT has failed
░░ Defined-By: systemd
░░ Support: http://www.ubuntu.com/support
░░ 
░░ A start job for unit UNIT has finished with a failure.
░░ 
░░ The job identifier is 416 and the job result is failed.

Plus, I think there is an issue with the Wi-Fi driver. I lose connection every two hours. To fix such an error, you have to turn the Wi-Fi button off and on on the laptop. There was something similar on Linux Mint. However, it disappeared either when I switched to Cinnamon or after some update.

And when I launch one of the The West Wing episodes using MPV, I get the following error:

 (+) Subs  --sid=1 'The West Wing S02E01.srt' (subrip) (external)
Error decoding audio.
AO: [pulse] 48000Hz stereo 2ch float

A possible solution to this problem could be found here. However, I think it is a problem with the files themselves.

vladkinoman commented 3 years ago

A possible solution to the x86/cpu: VMX (outside TXT) disabled by BIOS problem: link. In short, go to your BIOS settings and enable the virtualization technology. Acording to Reddit these are the benefits of virtualization:

You would need to be employing AMD's enterprise-level virtualization encryption to be concerned with potential door opening for Spectre. It really doesn't make sense for it to be disabled by default on desktop systems - it just makes people think that AMD can't do it considering every other AMD and Intel system has always left it enabled by default.

It has me wondering at this point if there were just early issues with virtualization on the AM4 platform and that caused everyone to disable it by default and they never bothered turning it back on.

Maybe, it's the type of thing people might be hesitant to change if they don't know the reasoning behind the original change.

vladkinoman commented 3 years ago
Jan 09 16:13:26 pop-os kernel: kfd kfd: OLAND  not supported in kfd

Hmm :thinking: This is a very difficult problem because it is unclear why it is being displayed in the log file. What is OLAND? OLAND is a chip variant of my GPU according to this. I don't know what is kfd, but I know that it has something to do with ROCm - Open Source Platform for HPC and Ultrascale GPU Computing.

ROCm is focused on using AMD GPUs to accelerate computational tasks such as machine learning, engineering workloads, and scientific computing. In order to focus development efforts on these domains of interest, ROCm supports a targeted set of hardware configurations which are detailed on that GitHub page.

ROCm is a collection of software ranging from drivers and runtimes to libraries and developer tools. Some of this software may work with more GPUs than the "officially supported" list mentioned above, though AMD does not make any official claims of support for these devices on the ROCm software platform.

Unfortunately ROCm does not support AMD GPUs that use OLAND chips.

By the way, ROCmSupport says that Pop!_OS is not officially supported with ROCm. There is also an interesting tip in the same discussion:

The device /dev/kfd need video or render group, using ls -l /dev/kfd to check the device group, and using groups to check the current login user group, if not matched, do sudo usermod -a -G video $LOGNAME

could this launch be failing because the owner of the file /dev/kfd is root and not the actual user on OS.

Using usermod I have added myself to the video group and even root. Didn't help.

My conclusion is that the video card just doesn't support the ROCm technology which is supported in the new kernel version (probably).

vladkinoman commented 3 years ago
Jan 09 16:14:31 pop-os gdm-password][1328]: gkr-pam: unable to locate daemon control file

Looks like it hasn't been fixed. People are still complaining about the problem here. Nevertheless, they recommend adding XDG_RUNTIME_DIR=/run/user/$UID to /etc/profile. However, it didn't help me.

Also, this post recommends setting XDG_BIN_HOME in .profile.

According to this thread (and information on the discussion of the issue):

There is no issue. Gdm-password checks whether a file is set via an environment variable that is under normal circumstances in a systemd context not available at the time it first does the check. It will note that, and retry later when the session is there and PAM/systemd has set up the environments for your user. It's an informal "error" most likely only relevant for systems that do not use systemd/logind, are embedded and single-user and start to actively fail when the file that is being checked for here is not present.

That's probably true:

vladkinoman@pop-os:~$ journalctl | grep 'gkr-pam'
...
Jan 09 23:07:05 pop-os gdm-password][1342]: gkr-pam: unable to locate daemon control file
Jan 09 23:07:05 pop-os gdm-password][1342]: gkr-pam: stashed password to try later in open session
Jan 09 23:07:06 pop-os gdm-password][1342]: gkr-pam: gnome-keyring-daemon started properly and unlocked keyring

I think :thinking:

vladkinoman commented 3 years ago
Jan 09 16:14:46 pop-os systemd[1333]: Failed to start Application launched by gnome-session-binary.
░░ Subject: A start job for unit UNIT has failed
░░ Defined-By: systemd
░░ Support: http://www.ubuntu.com/support
░░ 
░░ A start job for unit UNIT has finished with a failure.
░░ 
░░ The job identifier is 416 and the job result is failed.

This is a very fresh issue. I haven't found a solution, but there are plenty of threads to keep track of: ArchLinux forum, AskUbuntu, Fedora - same question as AskUbuntu, Fedora - Failed to start Application launched by gnome-session-binary.

vladkinoman commented 3 years ago

So, I've only fixed the virtualization issue:

vladkinoman@pop-os:~$ journalctl -xb -p err
-- Logs begin at Fri 2021-01-08 15:23:33 EET, end at Sat 2021-01-09 23:21:38 EET. --
Jan 09 23:06:05 pop-os kernel: kfd kfd: OLAND  not supported in kfd
Jan 09 23:07:05 pop-os gdm-password][1342]: gkr-pam: unable to locate daemon control file
Jan 09 23:07:16 pop-os systemd[1348]: Failed to start Application launched by gnome-session-binary.
░░ Subject: A start job for unit UNIT has failed
░░ Defined-By: systemd
░░ Support: http://www.ubuntu.com/support
░░ 
░░ A start job for unit UNIT has finished with a failure.
░░ 
░░ The job identifier is 312 and the job result is failed.
Jan 09 23:07:23 pop-os systemd[1348]: Failed to start Application launched by gnome-session-binary.
░░ Subject: A start job for unit UNIT has failed
░░ Defined-By: systemd
░░ Support: http://www.ubuntu.com/support
░░ 
░░ A start job for unit UNIT has finished with a failure.
░░ 
░░ The job identifier is 419 and the job result is failed.
vladkinoman commented 3 years ago

I'm getting strange new errors every 10 minutes or so:

Jan 10 19:34:04 pop-os kernel: blk_update_request: I/O error, dev sr0, sector 0 op 0x0:(READ) flags 0x80700 phys_seg 3 prio class 0
Jan 10 19:34:04 pop-os kernel: blk_update_request: I/O error, dev sr0, sector 0 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
Jan 10 19:34:04 pop-os kernel: Buffer I/O error on dev sr0, logical block 0, async page read
Jan 10 19:34:04 pop-os kernel: blk_update_request: I/O error, dev sr0, sector 1 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
Jan 10 19:34:04 pop-os kernel: Buffer I/O error on dev sr0, logical block 1, async page read
Jan 10 19:34:04 pop-os kernel: blk_update_request: I/O error, dev sr0, sector 2 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
Jan 10 19:34:04 pop-os kernel: Buffer I/O error on dev sr0, logical block 2, async page read
Jan 10 19:34:04 pop-os kernel: blk_update_request: I/O error, dev sr0, sector 3 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
Jan 10 19:34:04 pop-os kernel: Buffer I/O error on dev sr0, logical block 3, async page read
Jan 10 19:34:04 pop-os kernel: blk_update_request: I/O error, dev sr0, sector 4 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
Jan 10 19:34:04 pop-os kernel: Buffer I/O error on dev sr0, logical block 4, async page read
Jan 10 19:34:04 pop-os kernel: blk_update_request: I/O error, dev sr0, sector 5 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
Jan 10 19:34:04 pop-os kernel: Buffer I/O error on dev sr0, logical block 5, async page read
Jan 10 19:34:04 pop-os kernel: blk_update_request: I/O error, dev sr0, sector 6 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
Jan 10 19:34:04 pop-os kernel: Buffer I/O error on dev sr0, logical block 6, async page read
Jan 10 19:34:04 pop-os kernel: blk_update_request: I/O error, dev sr0, sector 7 op 0x0:(READ) flags 0x0 phys_seg 1 prio class 0
Jan 10 19:34:04 pop-os kernel: Buffer I/O error on dev sr0, logical block 7, async page read

I work with Lutris at times like this.

vladkinoman commented 3 years ago

Plus, I think there is an issue with the Wi-Fi driver. I lose connection every two hours. To fix such an error, you have to turn the Wi-Fi button off and on on the laptop. There was something similar on Linux Mint. However, it disappeared either when I switched to Cinnamon or after some update.

I figured out what the problem was. Solved it here.

vladkinoman commented 2 years ago

Distrohopped to Fedora 35.