sched-ext / scx

sched_ext schedulers and tools
https://bit.ly/scx_slack
GNU General Public License v2.0
838 stars 78 forks source link

Kernel OOPS under high memory pressure #370

Open SoulHarsh007 opened 3 months ago

SoulHarsh007 commented 3 months ago

The system was under high memory pressure, and this is when this kernel oops was triggered, I am not sure if high memory pressure was the root cause here.

Full Log: https://paste.cachyos.org/p/4af5d85

Relevant Trace:

Jun 18 20:37:40 SoulHarsh007 kernel: ------------[ cut here ]------------
Jun 18 20:37:40 SoulHarsh007 kernel: WARNING: CPU: 7 PID: 1 at kernel/sched/ext.c:3725 scx_cgroup_can_attach+0x196/0x340
Jun 18 20:37:40 SoulHarsh007 kernel: Modules linked in: xt_nat xt_tcpudp veth xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack_netlink nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xfrm_user xfrm_algo xt_addrtype nft_compat nf_tables br_netfilter bridge stp llc overlay rfcomm snd_seq_dummy snd_seq_midi snd_hrtimer snd_seq_midi_event snd_seq cmac algif_hash algif_skcipher af_alg bnep vmnet(OE) nct6683 iwlmvm mac80211 libarc4 ptp pps_core uvcvideo videobuf2_vmalloc uvc videobuf2_memops snd_usb_audio videobuf2_v4l2 snd_usbmidi_lib iwlwifi videodev snd_ump snd_rawmidi videobuf2_common snd_seq_device r8169 mc realtek mdio_devres cfg80211 libphy btusb btrtl btintel btbcm btmtk bluetooth rfkill crc16 mousedev joydev hid_generic snd_hda_codec_realtek snd_hda_codec_generic snd_hda_scodec_component snd_hda_codec_hdmi amd_atl intel_rapl_msr intel_rapl_common vfat snd_hda_intel fat snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec snd_hda_core kvm_amd snd_hwdep razermouse(OE) usbhid snd_pcm snd_timer kvm snd gpio_amdpt soundcore wmi_bmof
Jun 18 20:37:40 SoulHarsh007 kernel:  i2c_piix4 pcspkr gpio_generic rapl mac_hid lz4 lz4_compress dm_mod loop nfnetlink zram ip_tables x_tables crct10dif_pclmul crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel sha512_ssse3 sha256_ssse3 sha1_ssse3 nvme aesni_intel gf128mul nvme_core crypto_simd cryptd ccp xhci_pci nvme_auth zenpower(OE) xhci_pci_renesas amdgpu video wmi amdxcp i2c_algo_bit drm_ttm_helper ttm drm_exec gpu_sched drm_suballoc_helper drm_buddy drm_display_helper cec btrfs blake2b_generic libcrc32c crc32c_generic crc32c_intel xor raid6_pq winesync(OE) vmmon(OE) vmw_vmci vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) i2c_dev sg crypto_user
Jun 18 20:37:40 SoulHarsh007 kernel: CPU: 7 PID: 1 Comm: systemd Tainted: G S         OE      6.10.0-rc4-1-cachyos-rc #1 5c580bd5f712751ef3d9c6e2d14b3ccfe6c31623
Jun 18 20:37:40 SoulHarsh007 kernel: Hardware name: Micro-Star International Co., Ltd. MS-7C91/MAG B550 TOMAHAWK (MS-7C91), BIOS A.G0 03/12/2024
Jun 18 20:37:40 SoulHarsh007 kernel: Sched_ext: rustland (enabled+all), task: runnable_at=-3ms
Jun 18 20:37:40 SoulHarsh007 kernel: RIP: 0010:scx_cgroup_can_attach+0x196/0x340
Jun 18 20:37:40 SoulHarsh007 kernel: Code: ff 48 8b 04 24 48 c7 c5 80 f0 8d a8 48 85 c0 0f 85 50 ff ff ff 48 83 bb a8 03 00 00 00 48 c7 c0 80 f0 8d a8 0f 84 5a ff ff ff <0f> 0b e9 53 ff ff ff 80 3d cf 8b 27 02 00 0f 85 5d ff ff ff ba 01
Jun 18 20:37:40 SoulHarsh007 kernel: RSP: 0018:ffffad554006fa30 EFLAGS: 00010286
Jun 18 20:37:40 SoulHarsh007 kernel: RAX: ffff958640fdc000 RBX: ffff9586fb3da600 RCX: 0000000000000001
Jun 18 20:37:40 SoulHarsh007 kernel: RDX: ffffffffa88df080 RSI: ffffad554006fa30 RDI: ffffad554006fb20
Jun 18 20:37:40 SoulHarsh007 kernel: RBP: ffffffffa88df080 R08: ffff958ac5e8f400 R09: ffffad554006fb20
Jun 18 20:37:40 SoulHarsh007 kernel: R10: 0000000000000001 R11: 0000000000002000 R12: ffff958640893900
Jun 18 20:37:40 SoulHarsh007 kernel: R13: ffffad554006fb20 R14: ffffad554006fa30 R15: ffff9586fb3da600
Jun 18 20:37:40 SoulHarsh007 kernel: FS:  00007c6fca2af880(0000) GS:ffff958c41d80000(0000) knlGS:0000000000000000
Jun 18 20:37:40 SoulHarsh007 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 18 20:37:40 SoulHarsh007 kernel: CR2: 000061945cd3b000 CR3: 0000000307fbc000 CR4: 0000000000f506f0
Jun 18 20:37:40 SoulHarsh007 kernel: PKRU: 55555554
Jun 18 20:37:40 SoulHarsh007 kernel: Call Trace:
Jun 18 20:37:40 SoulHarsh007 kernel:  <TASK>
Jun 18 20:37:40 SoulHarsh007 kernel:  ? scx_cgroup_can_attach+0x196/0x340
Jun 18 20:37:40 SoulHarsh007 kernel:  ? __warn.cold+0x8e/0xf3
Jun 18 20:37:40 SoulHarsh007 kernel:  ? scx_cgroup_can_attach+0x196/0x340
Jun 18 20:37:40 SoulHarsh007 kernel:  ? report_bug+0xe7/0x200
Jun 18 20:37:40 SoulHarsh007 kernel:  ? handle_bug+0x3c/0x80
Jun 18 20:37:40 SoulHarsh007 kernel:  ? exc_invalid_op+0x19/0xc0
Jun 18 20:37:40 SoulHarsh007 kernel:  ? asm_exc_invalid_op+0x1a/0x20
Jun 18 20:37:40 SoulHarsh007 kernel:  ? scx_cgroup_can_attach+0x196/0x340
Jun 18 20:37:40 SoulHarsh007 kernel:  cgroup_migrate_execute+0x5b1/0x700
Jun 18 20:37:40 SoulHarsh007 kernel:  ? cgroup_migrate+0x15f/0x360
Jun 18 20:37:40 SoulHarsh007 kernel:  cgroup_attach_task+0x296/0x400
Jun 18 20:37:40 SoulHarsh007 kernel:  ? cgroup_attach_permissions+0x8b/0x230
Jun 18 20:37:40 SoulHarsh007 kernel:  __cgroup_procs_write+0x128/0x140
Jun 18 20:37:40 SoulHarsh007 kernel:  cgroup_procs_write+0x17/0x30
Jun 18 20:37:40 SoulHarsh007 kernel:  kernfs_fop_write_iter+0x141/0x1f0
Jun 18 20:37:40 SoulHarsh007 kernel:  vfs_write+0x31d/0x4a0
Jun 18 20:37:40 SoulHarsh007 kernel:  __x64_sys_write+0x72/0xf0
Jun 18 20:37:40 SoulHarsh007 kernel:  do_syscall_64+0x82/0x160
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? __x64_sys_openat+0x1f5/0x230
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? syscall_exit_to_user_mode+0x76/0x1f0
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? do_syscall_64+0x8e/0x160
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? syscall_exit_to_user_mode+0x76/0x1f0
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? do_syscall_64+0x8e/0x160
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? syscall_exit_to_user_mode+0x76/0x1f0
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? __x64_sys_fcntl+0x98/0xd0
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? syscall_exit_to_user_mode+0x76/0x1f0
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? do_syscall_64+0x8e/0x160
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? do_syscall_64+0x8e/0x160
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? syscall_exit_to_user_mode+0x76/0x1f0
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? do_syscall_64+0x8e/0x160
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  ? do_syscall_64+0x8e/0x160
Jun 18 20:37:40 SoulHarsh007 kernel:  ? irq_exit_rcu+0x53/0xc0
Jun 18 20:37:40 SoulHarsh007 kernel:  ? srso_alias_return_thunk+0x5/0xfbef5
Jun 18 20:37:40 SoulHarsh007 kernel:  entry_SYSCALL_64_after_hwframe+0x76/0x7e
Jun 18 20:37:40 SoulHarsh007 kernel: RIP: 0033:0x7c6fc9d175a4
Jun 18 20:37:40 SoulHarsh007 kernel: Code: c7 00 16 00 00 00 b8 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 f3 0f 1e fa 80 3d a5 6a 0e 00 00 74 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 48 83 ec 28 48 89 54 24 18 48
Jun 18 20:37:40 SoulHarsh007 kernel: RSP: 002b:00007ffdf24f8a48 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
Jun 18 20:37:40 SoulHarsh007 kernel: RAX: ffffffffffffffda RBX: 0000000000000005 RCX: 00007c6fc9d175a4
Jun 18 20:37:40 SoulHarsh007 kernel: RDX: 0000000000000005 RSI: 00007ffdf24f8c3a RDI: 0000000000000008
Jun 18 20:37:40 SoulHarsh007 kernel: RBP: 00007ffdf24f8c3a R08: 0000000000000005 R09: 0000000000000000
Jun 18 20:37:40 SoulHarsh007 kernel: R10: 0000000000000000 R11: 0000000000000202 R12: 0000000000000005
Jun 18 20:37:40 SoulHarsh007 kernel: R13: 000061945cd36b10 R14: 00007ffdf24f8c3a R15: 0000000000000005
Jun 18 20:37:40 SoulHarsh007 kernel:  </TASK>
Jun 18 20:37:40 SoulHarsh007 kernel: ---[ end trace 0000000000000000 ]---