Open SQalliT opened 3 weeks ago
This should be fixed in scx-scheds-git. Could you please retry, when installing the scx-scheds-git package?
Ive installed the scx-scheds-git package and still get the issue. Please see below for the most recent journalctl showing boot and plasma ksplash failure:
Jun 14 13:57:54 ra kernel: sched_ext: BPF scheduler "rusty" errored, disabling
Jun 14 13:57:54 ra kernel: sched_ext: runnable task stall (ksplashqml[1290] failed to run for 13.937s)
Jun 14 13:57:54 ra kernel: scx_watchdog_workfn+0x154/0x1e0
Jun 14 13:57:54 ra kernel: process_one_work+0x18e/0x350
Jun 14 13:57:54 ra kernel: worker_thread+0x2fa/0x490
Jun 14 13:57:54 ra kernel: kthread+0xd2/0x100
Jun 14 13:57:54 ra kernel: ret_from_fork+0x34/0x50
Jun 14 13:57:54 ra kernel: ret_from_fork_asm+0x1a/0x30
Jun 14 13:57:54 ra systemd[1]: scx.service: Main process exited, code=exited, status=1/FAILURE
Jun 14 13:57:54 ra systemd[1]: scx.service: Failed with result 'exit-code'.
Jun 14 13:57:54 ra systemd[1]: scx.service: Scheduled restart job, restart counter is at 1.
Jun 14 13:57:54 ra systemd[1]: Started Start scx_scheduler.
Jun 14 13:57:56 ra kernel: usb 1-1: USB disconnect, device number 2
Jun 14 13:58:04 ra kernel: rcu_tasks_wait_gp: rcu_tasks grace period number 41 (since boot) is 10088 jiffies old.
Jun 14 13:58:05 ra xdg-desktop-por[1294]: Failed to create settings proxy: Error calling StartServiceByName for org.freedesktop.impl.portal.desktop.kde: Timeout was reached
Jun 14 13:58:05 ra xdg-desktop-por[1294]: No skeleton to export
Jun 14 13:58:05 ra kernel: sched_ext: BPF scheduler "rusty" errored, disabling
Jun 14 13:58:05 ra kernel: sched_ext: runnable task stall (kworker/31:1[311] failed to run for 10.172s)
Jun 14 13:58:05 ra kernel: scx_watchdog_workfn+0x154/0x1e0
Jun 14 13:58:05 ra kernel: process_one_work+0x18e/0x350
Jun 14 13:58:05 ra kernel: worker_thread+0x2fa/0x490
Jun 14 13:58:05 ra kernel: kthread+0xd2/0x100
Jun 14 13:58:05 ra kernel: ret_from_fork+0x34/0x50
Jun 14 13:58:05 ra kernel: ret_from_fork_asm+0x1a/0x30
Jun 14 13:58:05 ra systemd[1]: scx.service: Main process exited, code=exited, status=1/FAILURE
Jun 14 13:58:05 ra systemd[1]: scx.service: Failed with result 'exit-code'.
Jun 14 13:58:05 ra systemd[1]: scx.service: Scheduled restart job, restart counter is at 2.
Jun 14 13:58:05 ra systemd[1]: Started Start scx_scheduler.
Jun 14 13:58:10 ra systemd[1]: systemd-localed.service: Deactivated successfully.
Jun 14 13:58:16 ra kernel: sched_ext: BPF scheduler "rusty" errored, disabling
Jun 14 13:58:16 ra kernel: sched_ext: runnable task stall (kworker/31:1[311] failed to run for 10.179s)
Jun 14 13:58:16 ra kernel: scx_watchdog_workfn+0x154/0x1e0
Jun 14 13:58:16 ra kernel: process_one_work+0x18e/0x350
Jun 14 13:58:16 ra kernel: worker_thread+0x2fa/0x490
Jun 14 13:58:16 ra kernel: kthread+0xd2/0x100
Jun 14 13:58:16 ra kernel: ret_from_fork+0x34/0x50
Jun 14 13:58:16 ra kernel: ret_from_fork_asm+0x1a/0x30
Jun 14 13:58:16 ra systemd[1]: scx.service: Main process exited, code=exited, status=1/FAILURE
Jun 14 13:58:05 ra kernel: sched_ext: BPF scheduler "rusty" errored, disabling
Jun 14 13:58:05 ra kernel: sched_ext: runnable task stall (kworker/31:1[311] failed to run for 10.172s)
Jun 14 13:58:05 ra kernel: scx_watchdog_workfn+0x154/0x1e0
Jun 14 13:58:05 ra kernel: process_one_work+0x18e/0x350
Jun 14 13:58:05 ra kernel: worker_thread+0x2fa/0x490
Jun 14 13:58:05 ra kernel: kthread+0xd2/0x100
Jun 14 13:58:05 ra kernel: ret_from_fork+0x34/0x50
Jun 14 13:58:05 ra kernel: ret_from_fork_asm+0x1a/0x30
Jun 14 13:58:05 ra systemd[1]: scx.service: Main process exited, code=exited, status=1/FAILURE
Jun 14 13:58:05 ra systemd[1]: scx.service: Failed with result 'exit-code'.
Jun 14 13:58:05 ra systemd[1]: scx.service: Scheduled restart job, restart counter is at 2.
Jun 14 13:58:05 ra systemd[1]: Started Start scx_scheduler.
Jun 14 13:58:10 ra systemd[1]: systemd-localed.service: Deactivated successfully.
Jun 14 13:58:16 ra kernel: sched_ext: BPF scheduler "rusty" errored, disabling
Jun 14 13:58:16 ra kernel: sched_ext: runnable task stall (kworker/31:1[311] failed to run for 10.179s)
Jun 14 13:58:16 ra kernel: scx_watchdog_workfn+0x154/0x1e0
Jun 14 13:58:16 ra kernel: process_one_work+0x18e/0x350
Jun 14 13:58:16 ra kernel: worker_thread+0x2fa/0x490
Jun 14 13:58:16 ra kernel: kthread+0xd2/0x100
Jun 14 13:58:16 ra kernel: ret_from_fork+0x34/0x50
Jun 14 13:58:16 ra kernel: ret_from_fork_asm+0x1a/0x30
Jun 14 13:58:16 ra systemd[1]: scx.service: Main process exited, code=exited, status=1/FAILURE
Jun 14 13:58:16 ra systemd[1]: scx.service: Failed with result 'exit-code'.
Jun 14 13:58:16 ra systemd[1]: scx.service: Scheduled restart job, restart counter is at 3.
Jun 14 13:58:16 ra systemd[1]: scx.service: Start request repeated too quickly.
Jun 14 13:58:16 ra systemd[1]: scx.service: Failed with result 'exit-code'.
Jun 14 13:58:16 ra systemd[1]: Failed to start Start scx_scheduler.
Jun 14 13:58:20 ra systemd[1244]: plasma-ksplash.service: start operation timed out. Terminating.
Jun 14 13:58:30 ra xdg-desktop-por[1294]: Failed to create file chooser proxy: Error calling StartServiceByName for org.freedesktop.impl.portal.desktop.kde: Timeout was reached
Jun 14 13:58:30 ra xdg-desktop-por[1294]: No skeleton to export
Jun 14 13:58:34 ra kernel: rcu_tasks_wait_gp: rcu_tasks grace period number 41 (since boot) is 40200 jiffies old.
Jun 14 13:58:37 ra kernel: usb 1-1: new full-speed USB device number 6 using xhci_hcd
Jun 14 13:58:37 ra kernel: usb 1-1: New USB device found, idVendor=706b, idProduct=0011, bcdDevice= 0.01
Jun 14 13:58:37 ra kernel: usb 1-1: New USB device strings: Mfr=1, Product=2, SerialNumber=0
Jun 14 13:58:37 ra kernel: usb 1-1: Product: ikki68 Aurora
Jun 14 13:58:37 ra kernel: usb 1-1: Manufacturer: wuque studio
Jun 14 13:58:37 ra kernel: input: wuque studio ikki68 Aurora as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/0000:03:08.0/0000:07:00.1/usb1/1-1/1-1:1.0/0003:706B:0011.000B/input/input23
Jun 14 13:58:37 ra kernel: hid-generic 0003:706B:0011.000B: input,hidraw0: USB HID v1.11 Keyboard [wuque studio ikki68 Aurora] on usb-0000:07:00.1-1/input0
Jun 14 13:58:37 ra kernel: hid-generic 0003:706B:0011.000C: hiddev96,hidraw1: USB HID v1.11 Device [wuque studio ikki68 Aurora] on usb-0000:07:00.1-1/input1
Jun 14 13:58:37 ra kernel: input: wuque studio ikki68 Aurora Mouse as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/0000:03:08.0/0000:07:00.1/usb1/1-1/1-1:1.2/0003:706B:0011.000D/input/input24
Jun 14 13:58:37 ra kernel: input: wuque studio ikki68 Aurora System Control as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/0000:03:08.0/0000:07:00.1/usb1/1-1/1-1:1.2/0003:706B:0011.000D/input/i>
Jun 14 13:58:37 ra kernel: input: wuque studio ikki68 Aurora Consumer Control as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/0000:03:08.0/0000:07:00.1/usb1/1-1/1-1:1.2/0003:706B:0011.000D/input>
Jun 14 13:58:37 ra kernel: input: wuque studio ikki68 Aurora Keyboard as /devices/pci0000:00/0000:00:01.2/0000:02:00.0/0000:03:08.0/0000:07:00.1/usb1/1-1/1-1:1.2/0003:706B:0011.000D/input/input27
Jun 14 13:58:37 ra kernel: hid-generic 0003:706B:0011.000D: input,hidraw3: USB HID v1.11 Mouse [wuque studio ikki68 Aurora] on usb-0000:07:00.1-1/input2
Jun 14 13:58:37 ra mtp-probe[1451]: checking bus 1, device 6: "/sys/devices/pci0000:00/0000:00:01.2/0000:02:00.0/0000:03:08.0/0000:07:00.1/usb1/1-1"
Jun 14 13:58:37 ra mtp-probe[1451]: bus: 1, device: 6 was not an MTP device
Jun 14 13:58:37 ra systemd-logind[1013]: Watching system buttons on /dev/input/event6 (wuque studio ikki68 Aurora Consumer Control)
Jun 14 13:58:37 ra systemd-logind[1013]: Watching system buttons on /dev/input/event2 (wuque studio ikki68 Aurora)
Jun 14 13:58:37 ra systemd-logind[1013]: Watching system buttons on /dev/input/event4 (wuque studio ikki68 Aurora System Control)
Jun 14 13:58:37 ra systemd-logind[1013]: Watching system buttons on /dev/input/event7 (wuque studio ikki68 Aurora Keyboard)
Jun 14 13:58:37 ra mtp-probe[1486]: checking bus 1, device 6: "/sys/devices/pci0000:00/0000:00:01.2/0000:02:00.0/0000:03:08.0/0000:07:00.1/usb1/1-1"
Jun 14 13:58:37 ra mtp-probe[1486]: bus: 1, device: 6 was not an MTP device
Jun 14 13:58:40 ra plasma_waitforname[1281]: org.kde.knotifications: WaitForName: Service was not registered within timeout
Jun 14 13:58:40 ra systemd[1244]: dbus-:1.2-org.kde.KSplash@0.service: Main process exited, code=exited, status=1/FAILURE
Jun 14 13:58:40 ra systemd[1244]: dbus-:1.2-org.kde.KSplash@0.service: Failed with result 'exit-code'.
Jun 14 13:58:40 ra systemd[1244]: plasma-ksplash.service: Main process exited, code=killed, status=15/TERM
Jun 14 13:58:40 ra systemd[1244]: plasma-ksplash.service: Failed with result 'timeout'.
Jun 14 13:58:40 ra systemd[1244]: Failed to start Splash screen shown during boot.
Thanks @SQalliT, I'll take a look
Wasn't able to reproduce this on 6.9.3-1 with gnome. Let me see if it happens on 6.9.4, and then also try it with kde.
@ptr1337 Unfortunately the issue still persists on scx-scheds-git
I have this same issue, but with scx_rustland. The black screen eventually clears itself however.
Additionally, I have general system lag on login when using the systemd service. Stopping the service and restarting it seems to fix the issue. I'm unsure if that's related to this issue, but it seems like it might be.
Could you verify if https://github.com/sched-ext/scx/pull/387 helps?
Could you verify if #387 helps?
Im not sure I follow how to do that, my scx.service is as follows:
[Unit]
Description=Start scx_scheduler
ConditionPathIsDirectory=/sys/kernel/sched_ext
StartLimitIntervalSec=30
StartLimitBurst=2
[Service]
Type=simple
EnvironmentFile=/etc/default/scx
ExecStart=/bin/bash -c 'exec ${SCX_SCHEDULER_OVERRIDE:-$SCX_SCHEDULER} ${SCX_FLAGS_OVERRIDE:-$SCX_FLAGS} '
Restart=on-failure
StandardError=journal
LogNamespace=sched-ext
[Install]
WantedBy=graphical.target
Could you verify if #387 helps?
Im not sure I follow how to do that, my scx.service is as follows:
[Unit]
Description=Start scx_scheduler
ConditionPathIsDirectory=/sys/kernel/sched_ext
StartLimitIntervalSec=30
StartLimitBurst=2
[Service]
Type=simple
EnvironmentFile=/etc/default/scx
ExecStart=/bin/bash -c 'exec ${SCX_SCHEDULER_OVERRIDE:-$SCX_SCHEDULER} ${SCX_FLAGS_OVERRIDE:-$SCX_FLAGS} '
Restart=on-failure
StandardError=journal
LogNamespace=sched-ext
[Install]
WantedBy=graphical.target
Just put following into the service:
[Unit]
Description=Start scx_scheduler
ConditionPathIsDirectory=/sys/kernel/sched_ext
StartLimitIntervalSec=30
StartLimitBurst=2
After=graphical.target
[Service]
Type=simple
EnvironmentFile=/etc/default/scx
ExecStart=/bin/bash -c 'exec ${SCX_SCHEDULER_OVERRIDE:-$SCX_SCHEDULER} ${SCX_FLAGS_OVERRIDE:-$SCX_FLAGS} '
Restart=on-failure
StandardError=journal
LogNamespace=sched-ext
[Install]
WantedBy=multi-user.target
That seems to have fixed the issue. Thanks for the help
Your're welcome
Sadly I may have jumped the gun, still getting issues:
Jun 25 16:27:29 ra kernel: sched_ext: BPF scheduler "rusty" errored, disabling
Jun 25 16:27:29 ra kernel: sched_ext: runnable task stall (Xwayland[1371] failed to run for 14.157s)
Jun 25 16:27:29 ra kernel: scx_watchdog_workfn+0x154/0x1e0
Jun 25 16:27:29 ra kernel: process_one_work+0x18e/0x350
Jun 25 16:27:29 ra kernel: worker_thread+0x2fa/0x490
Jun 25 16:27:29 ra kernel: kthread+0xd2/0x100
Jun 25 16:27:29 ra kernel: ret_from_fork+0x34/0x50
Jun 25 16:27:29 ra kernel: ret_from_fork_asm+0x1a/0x30
my scx.service is:
[Unit]
Description=Start scx_scheduler
ConditionPathIsDirectory=/sys/kernel/sched_ext
StartLimitIntervalSec=30
StartLimitBurst=2
After=graphical.target
[Service]
Type=simple
EnvironmentFile=/etc/default/scx
ExecStart=/bin/bash -c 'exec ${SCX_SCHEDULER_OVERRIDE:-$SCX_SCHEDULER} ${SCX_FLAGS_OVERRIDE:-$SCX_FLAGS} '
Restart=on-failure
StandardError=journal
LogNamespace=sched-ext
[Install]
WantedBy=multi-user.target
Could you add:
[Unit]
Description=Run scx schedulers
[Timer]
OnBootSec=30sec
Unit=scx.service
[Install]
WantedBy=multi-user.target
into /usr/lib/systemd/system/scx.timer (you've create this file) and run
sudo systemctl enable scx.timer
And then try to reboot the system?
Of course, first disable scx.service.
Still getting:
Jun 25 16:42:53 ra kernel: sched_ext: BPF scheduler "rusty" errored, disabling
Jun 25 16:42:53 ra kernel: sched_ext: runnable task stall (kworker/31:1[311] failed to run for 14.672s)
Jun 25 16:42:53 ra kernel: scx_watchdog_workfn+0x154/0x1e0
Jun 25 16:42:53 ra kernel: process_one_work+0x18e/0x350
Jun 25 16:42:53 ra kernel: worker_thread+0x2fa/0x490
Jun 25 16:42:53 ra kernel: kthread+0xd2/0x100
Jun 25 16:42:53 ra kernel: ret_from_fork+0x34/0x50
Jun 25 16:42:53 ra kernel: ret_from_fork_asm+0x1a/0x30
Jun 25 16:42:53 ra systemd[1]: scx.service: Main process exited, code=exited, status=1/FAILURE
Jun 25 16:42:53 ra systemd[1]: scx.service: Failed with result 'exit-code'.
Jun 25 16:42:54 ra systemd[1]: scx.service: Scheduled restart job, restart counter is at 1.
Jun 25 16:42:54 ra systemd[1]: Started Start scx_scheduler.
Jun 25 16:43:02 ra systemd[1]: systemd-hostnamed.service: Deactivated successfully.
Jun 25 16:43:03 ra xdg-desktop-por[1291]: Failed to create settings proxy: Error calling StartServiceByName for org.freedesktop.impl.portal.desktop.kde: Timeout was reached
Jun 25 16:43:03 ra xdg-desktop-por[1291]: No skeleton to export
Jun 25 16:43:03 ra kernel: rcu_tasks_wait_gp: rcu_tasks grace period number 69 (since boot) is 10086 jiffies old.
Jun 25 16:43:04 ra kernel: sched_ext: BPF scheduler "rusty" errored, disabling
Jun 25 16:43:04 ra kernel: sched_ext: runnable task stall (kworker/31:1[311] failed to run for 10.218s)
Jun 25 16:43:04 ra kernel: scx_watchdog_workfn+0x154/0x1e0
Jun 25 16:43:04 ra kernel: process_one_work+0x18e/0x350
Jun 25 16:43:04 ra kernel: worker_thread+0x2fa/0x490
Jun 25 16:43:04 ra kernel: kthread+0xd2/0x100
Jun 25 16:43:04 ra kernel: ret_from_fork+0x34/0x50
Jun 25 16:43:04 ra kernel: ret_from_fork_asm+0x1a/0x30
scx.service:
[Unit]
Description=Start scx_scheduler
ConditionPathIsDirectory=/sys/kernel/sched_ext
StartLimitIntervalSec=30
StartLimitBurst=2
After=graphical.target
[Service]
Type=simple
EnvironmentFile=/etc/default/scx
ExecStart=/bin/bash -c 'exec ${SCX_SCHEDULER_OVERRIDE:-$SCX_SCHEDULER} ${SCX_FLAGS_OVERRIDE:-$SCX_FLAGS} '
Restart=on-failure
StandardError=journal
LogNamespace=sched-ext
[Install]
WantedBy=multi-user.target
scx.timer:
[Unit]
Description=Run scx schedulers
[Timer]
OnBootSec=30sec
Unit=scx.service
[Install]
WantedBy=graphical.target
Systemctl status':
❯ systemctl status scx.service
● scx.service - Start scx_scheduler
Loaded: loaded (/usr/lib/systemd/system/scx.service; enabled; preset: disabled)
Active: active (running) since Tue 2024-06-25 16:43:04 BST; 1min 57s ago
Invocation: fb432f8e74424d8bb214d8c1437a3081
TriggeredBy: ● scx.timer
Main PID: 1416 (scx_rusty)
Tasks: 12 (limit: 38332)
Memory: 226.9M (peak: 237.7M)
CPU: 240ms
CGroup: /system.slice/scx.service
└─1416 scx_rusty
Jun 25 16:45:02 ra bash[1416]: 15:45:02 [INFO] dir= 1.24 dir_greedy= 0.00 dir_greedy_far= 0.00
Jun 25 16:45:02 ra bash[1416]: 15:45:02 [INFO] dsq=14.46 greedy_local= 0.12 greedy_xnuma= 0.00
Jun 25 16:45:02 ra bash[1416]: 15:45:02 [INFO] kick_greedy= 0.12 rep= 0.00
Jun 25 16:45:02 ra bash[1416]: 15:45:02 [INFO] dl_clamped= 3.96 dl_preset=10.63
Jun 25 16:45:02 ra bash[1416]: 15:45:02 [INFO] slice_length=20000us
Jun 25 16:45:02 ra bash[1416]: 15:45:02 [INFO] direct_greedy_cpumask=0b11111111111111111111111111111111
Jun 25 16:45:02 ra bash[1416]: 15:45:02 [INFO] kick_greedy_cpumask=0b11111111111111111111111111111111
Jun 25 16:45:02 ra bash[1416]: 15:45:02 [INFO] NODE[00] load=0.04 imbal=+0.00 load_delta=+0.00
Jun 25 16:45:02 ra bash[1416]: 15:45:02 [INFO] DOMAIN[00] load=0.02 imbal=+0.00 load_delta=+0.00
Jun 25 16:45:02 ra bash[1416]: 15:45:02 [INFO] DOMAIN[01] load=0.02 imbal=-0.00 load_delta=-0.00
~
❯ systemctl status scx.timer
● scx.timer - Run scx schedulers
Loaded: loaded (/usr/lib/systemd/system/scx.timer; enabled; preset: disabled)
Active: active (elapsed) since Tue 2024-06-25 16:42:32 BST; 2min 33s ago
Invocation: b88a4c8c2e8e4125bedaff148c4a3463
Trigger: n/a
Triggers: ● scx.service
Jun 25 16:42:32 ra systemd[1]: Started Run scx schedulers.
~
could you share following:
systemd-analyze
and which CPU you have?
❯ sudo systemd-analyze
Startup finished in 9.013s (firmware) + 4.127s (loader) + 3.621s (kernel) + 3.292s (userspace) = 20.055s
graphical.target reached after 3.292s in userspace.
CPU: AMD Ryzen 9 5950X
I have this same issue, but with scx_rustland. The black screen eventually clears itself however.
Additionally, I have general system lag on login when using the systemd service. Stopping the service and restarting it seems to fix the issue. I'm unsure if that's related to this issue, but it seems like it might be.
Does PR #392 make a difference?
After updating to the git of sched-ext and using the latest version of the .service file, I have no issues anymore. Everything works as expected.
After updating to the git of sched-ext and using the latest version of the .service file, I have no issues anymore. Everything works as expected.
@Daktyl198 So now the question: is it a matter of sched-ext fixes or systemd service.
Ive disabled and deleted my scx.timer and scx.service, updated to the newest sched-ext git and re-enabled scx.service, and rebooted. Sadly I am still getting the issue.
@SQalliT if you can reproduce the problem, can you please attach the output of journalctl -u scx.service -b -1
? Thanks.
I use CachyOS (Arch based) and have the scx service enabled, with the rusty scheduler specified. After logging in after first boot I get a black screen for several minutes that can only be skipped by switching to another tty and back. The issue is not present when using scx_simple for example.
CachyOS Linux x86_64 Kernel: 6.9.3-4-cachyos (issue persists on 6.10.rc3-1 too) DE: KDE Plasma 6.0.5 CPU: AMD Ryzen 9 5950X
Below is the excerpt from my most recent boot:
Below is an excerpt from a previous boot with more information:
This is one of my first issue submissions so please let me know if you need any more information or if Ive done anything wrong.