xlab-uiuc / memtis

Tiered memory management
0 stars 0 forks source link

Kernel error while running liblinear #2

Open yulistic opened 6 months ago

yulistic commented 6 months ago
[ 2705.483020] ================================================================================
[ 2705.483035] UBSAN: array-index-out-of-bounds in mm/huge_memory.c:2838:28
[ 2705.483041] index 256 is out of range for type 'long unsigned int [16]'
[ 2705.483045] CPU: 20 PID: 309 Comm: kswapd1 Tainted: G           OE     5.15.19-htmm #4
[ 2705.483049] Hardware name: Supermicro SYS-511E-WR/X13SEW-F, BIOS 1.0 09/26/2022
[ 2705.483050] Call Trace:
[ 2705.483054]  <TASK>
[ 2705.483072]  dump_stack_lvl+0x4a/0x5f
[ 2705.483080]  dump_stack+0x10/0x12
[ 2705.483083]  ubsan_epilogue+0x9/0x45
[ 2705.483086]  __ubsan_handle_out_of_bounds.cold+0x44/0x49
[ 2705.483089]  ? try_to_migrate+0xb9/0xd0
[ 2705.483106]  ? try_to_unmap_one+0xab0/0xab0
[ 2705.483110]  ? anon_vma_ctor+0x40/0x40
[ 2705.483113]  split_huge_page_to_list+0xbff/0xc40
[ 2705.483126]  ? __mem_cgroup_try_charge_swap+0x17b/0x210
[ 2705.483136]  shrink_page_list+0xdb3/0xf00
[ 2705.483146]  shrink_inactive_list+0x1ff/0x480
[ 2705.483150]  shrink_lruvec+0x439/0x720
[ 2705.483153]  shrink_node+0x2c6/0x760
[ 2705.483156]  balance_pgdat+0x36a/0x750
[ 2705.483159]  kswapd+0x202/0x3a0
[ 2705.483164]  ? wait_woken+0x60/0x60
[ 2705.483176]  ? balance_pgdat+0x750/0x750
[ 2705.483178]  kthread+0x127/0x150
[ 2705.483191]  ? set_kthread_struct+0x40/0x40
[ 2705.483195]  ret_from_fork+0x1f/0x30
[ 2705.483205]  </TASK>
[ 2705.483206] ================================================================================
[ 2705.483207] ================================================================================
[ 2705.483208] UBSAN: array-index-out-of-bounds in mm/huge_memory.c:2839:21
[ 2705.483210] index 256 is out of range for type 'long unsigned int [16]'
[ 2705.483212] CPU: 20 PID: 309 Comm: kswapd1 Tainted: G           OE     5.15.19-htmm #4
[ 2705.483214] Hardware name: Supermicro SYS-511E-WR/X13SEW-F, BIOS 1.0 09/26/2022
[ 2705.483215] Call Trace:
[ 2705.483215]  <TASK>
[ 2705.483216]  dump_stack_lvl+0x4a/0x5f
[ 2705.483219]  dump_stack+0x10/0x12
[ 2705.483221]  ubsan_epilogue+0x9/0x45
[ 2705.483224]  __ubsan_handle_out_of_bounds.cold+0x44/0x49
[ 2705.483226]  ? try_to_migrate+0xb9/0xd0
[ 2705.483229]  ? try_to_unmap_one+0xab0/0xab0
[ 2705.483232]  ? anon_vma_ctor+0x40/0x40
[ 2705.483234]  split_huge_page_to_list+0xc17/0xc40
[ 2705.483237]  ? __mem_cgroup_try_charge_swap+0x17b/0x210
[ 2705.483240]  shrink_page_list+0xdb3/0xf00
[ 2705.483243]  shrink_inactive_list+0x1ff/0x480
[ 2705.483246]  shrink_lruvec+0x439/0x720
[ 2705.483249]  shrink_node+0x2c6/0x760
[ 2705.483251]  balance_pgdat+0x36a/0x750
[ 2705.483255]  kswapd+0x202/0x3a0
[ 2705.483257]  ? wait_woken+0x60/0x60
[ 2705.483259]  ? balance_pgdat+0x750/0x750
[ 2705.483261]  kthread+0x127/0x150
[ 2705.483264]  ? set_kthread_struct+0x40/0x40
[ 2705.483267]  ret_from_fork+0x1f/0x30
[ 2705.483271]  </TASK>
[ 2705.483272] ================================================================================
[ 2705.483309] BUG: kernel NULL pointer dereference, address: 000000000000000c
[ 2705.483311] #PF: supervisor read access in kernel mode
[ 2705.483313] #PF: error_code(0x0000) - not-present page
[ 2705.483316] PGD 0 P4D 0
[ 2705.483322] Oops: 0000 [#1] SMP NOPTI
[ 2705.483325] CPU: 20 PID: 309 Comm: kswapd1 Tainted: G           OE     5.15.19-htmm #4
[ 2705.483327] Hardware name: Supermicro SYS-511E-WR/X13SEW-F, BIOS 1.0 09/26/2022
[ 2705.483328] RIP: 0010:remove_migration_pte+0x3d4/0x6e0
[ 2705.483332] Code: 80 48 2b 15 56 f8 38 01 48 01 ca 48 8b 35 3c f8 38 01 25 f8 0f 00 00 48 c1 ea 0c 48 c1 e2 06 48 03 44 16 18 0f 84 11 fd ff ff <66> 83 78 04 00 0f 85 06 fd ff ff 4c 89 ff ba 00 10 00 00 48 29 f7
[ 2705.483335] RSP: 0018:ffffc9000e63f850 EFLAGS: 00010202
[ 2705.483338] RAX: 0000000000000008 RBX: ffffea0082658040 RCX: ffff88a141b0f008
[ 2705.483340] RDX: 000000008306c3c0 RSI: ffffea0000000000 RDI: 000000000000001b
[ 2705.483341] RBP: ffffc9000e63f8c0 R08: ffffff8000000000 R09: 0000008000000000
[ 2705.483343] R10: 0400000000000080 R11: 0000000000000067 R12: 0000000000000001
[ 2705.483344] R13: ffff88810ab14d48 R14: ffffc9000e63f938 R15: ffffea0082658040
[ 2705.483346] FS:  0000000000000000(0000) GS:ffff889fffd00000(0000) knlGS:0000000000000000
[ 2705.483348] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 2705.483349] CR2: 000000000000000c CR3: 00000002141f8001 CR4: 0000000000770ee0
[ 2705.483351] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 2705.483352] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
[ 2705.483354] PKRU: 55555554
[ 2705.483355] Call Trace:
[ 2705.483356]  <TASK>
[ 2705.483357]  rmap_walk_anon+0xe8/0x260
[ 2705.483361]  rmap_walk_locked+0x28/0x40
[ 2705.483365]  remove_migration_ptes+0x5e/0x80
[ 2705.483367]  ? trace_event_raw_event_mm_migrate_pages_start+0xb0/0xb0
[ 2705.483370]  remap_page+0x52/0x80
[ 2705.483373]  split_huge_page_to_list+0x8a9/0xc40
[ 2705.483377]  shrink_page_list+0xdb3/0xf00
[ 2705.483380]  shrink_inactive_list+0x1ff/0x480
[ 2705.483383]  shrink_lruvec+0x439/0x720
[ 2705.483387]  shrink_node+0x2c6/0x760
[ 2705.483390]  balance_pgdat+0x36a/0x750
[ 2705.483393]  kswapd+0x202/0x3a0
[ 2705.483395]  ? wait_woken+0x60/0x60
[ 2705.483398]  ? balance_pgdat+0x750/0x750
[ 2705.483400]  kthread+0x127/0x150
[ 2705.483403]  ? set_kthread_struct+0x40/0x40
[ 2705.483407]  ret_from_fork+0x1f/0x30
[ 2705.483411]  </TASK>
[ 2705.483412] Modules linked in: xt_CHECKSUM(E) xt_MASQUERADE(E) xt_conntrack(E) ipt_REJECT(E) nf_reject_ipv4(E) xt_tcpudp(E) nft_compat(E) nft_chain_nat(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E) nf_defrag_ipv4(E) nft_counter(E) nf_tables(E) nfnetlink(E) bridge(E) stp(E) llc(E) rpcsec_gss_krb5(E) auth_rpcgss(E) nfsv4(E) nfs(E) lockd(E) grace(E) fscache(E) netfs(E) nvme_fabrics(E) socwatch2_15(OE) vtsspp(OE) sep5(OE) socperf3(OE) pax(OE) mlx5_ib(E) ib_uverbs(E) ib_core(E) intel_rapl_msr(E) intel_rapl_common(E) i10nm_edac(E) nfit(E) x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) sunrpc(E) mlx5_core(E) binfmt_misc(E) kvm_intel(E) mlxfw(E) psample(E) kvm(E) nls_iso8859_1(E) tls(E) rapl(E) isst_if_mbox_pci(E) joydev(E) pci_hyperv_intf(E) isst_if_mmio(E) input_leds(E) isst_if_common(E) pmt_telemetry(E) pmt_crashlog(E) idxd(E) pmt_class(E) idxd_bus(E) mei_me(E) mei(E) ipmi_ssif(E) acpi_ipmi(E) ipmi_si(E) ipmi_devintf(E) ipmi_msghandler(E) acpi_power_meter(E) acpi_pad(E) mac_hid(E)
[ 2705.483501]  sch_fq_codel(E) msr(E) parport_pc(E) ppdev(E) lp(E) ramoops(E) parport(E) pstore_blk(E) reed_solomon(E) pstore_zone(E) efi_pstore(E) ip_tables(E) x_tables(E) autofs4(E) btrfs(E) blake2b_generic(E) zstd_compress(E) raid10(E) raid456(E) async_raid6_recov(E) async_memcpy(E) async_pq(E) async_xor(E) async_tx(E) xor(E) rndis_host(E) cdc_ether(E) usbnet(E) mii(E) raid6_pq(E) libcrc32c(E) raid1(E) raid0(E) multipath(E) linear(E) dm_mirror(E) dm_region_hash(E) dm_log(E) hid_generic(E) usbhid(E) hid(E) crct10dif_pclmul(E) crc32_pclmul(E) ghash_clmulni_intel(E) aesni_intel(E) crypto_simd(E) cryptd(E) intel_pmt(E) ast(E) drm_vram_helper(E) drm_ttm_helper(E) ttm(E) nvme(E) drm_kms_helper(E) syscopyarea(E) sysfillrect(E) sysimgblt(E) fb_sys_fops(E) cec(E) rc_core(E) i2c_i801(E) igb(E) ahci(E) i2c_smbus(E) nvme_core(E) drm(E) dca(E) libahci(E) i2c_ismt(E) i2c_algo_bit(E) xhci_pci(E) xhci_pci_renesas(E) wmi(E) pinctrl_emmitsburg(E)
[ 2705.483800] CR2: 000000000000000c
[ 2705.483806] ---[ end trace 49476d92555e0dc1 ]---
[ 2705.498266] RIP: 0010:remove_migration_pte+0x3d4/0x6e0
[ 2705.498273] Code: 80 48 2b 15 56 f8 38 01 48 01 ca 48 8b 35 3c f8 38 01 25 f8 0f 00 00 48 c1 ea 0c 48 c1 e2 06 48 03 44 16 18 0f 84 11 fd ff ff <66> 83 78 04 00 0f 85 06 fd ff ff 4c 89 ff ba 00 10 00 00 48 29 f7
[ 2705.498275] RSP: 0018:ffffc9000e63f850 EFLAGS: 00010202
[ 2705.498278] RAX: 0000000000000008 RBX: ffffea0082658040 RCX: ffff88a141b0f008
[ 2705.498280] RDX: 000000008306c3c0 RSI: ffffea0000000000 RDI: 000000000000001b
[ 2705.498281] RBP: ffffc9000e63f8c0 R08: ffffff8000000000 R09: 0000008000000000
[ 2705.498283] R10: 0400000000000080 R11: 0000000000000067 R12: 0000000000000001
[ 2705.498284] R13: ffff88810ab14d48 R14: ffffc9000e63f938 R15: ffffea0082658040
[ 2705.498286] FS:  0000000000000000(0000) GS:ffff889fffd00000(0000) knlGS:0000000000000000
[ 2705.498288] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 2705.498290] CR2: 000000000000000c CR3: 00000002141f8001 CR4: 0000000000770ee0
[ 2705.498292] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 2705.498293] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
[ 2705.498294] PKRU: 55555554
[ 2705.498319] ------------[ cut here ]------------
[ 2705.498322] WARNING: CPU: 20 PID: 309 at kernel/exit.c:742 do_exit+0x49/0xad0
[ 2705.498344] Modules linked in: xt_CHECKSUM(E) xt_MASQUERADE(E) xt_conntrack(E) ipt_REJECT(E) nf_reject_ipv4(E) xt_tcpudp(E) nft_compat(E) nft_chain_nat(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E) nf_defrag_ipv4(E) nft_counter(E) nf_tables(E) nfnetlink(E) bridge(E) stp(E) llc(E) rpcsec_gss_krb5(E) auth_rpcgss(E) nfsv4(E) nfs(E) lockd(E) grace(E) fscache(E) netfs(E) nvme_fabrics(E) socwatch2_15(OE) vtsspp(OE) sep5(OE) socperf3(OE) pax(OE) mlx5_ib(E) ib_uverbs(E) ib_core(E) intel_rapl_msr(E) intel_rapl_common(E) i10nm_edac(E) nfit(E) x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) sunrpc(E) mlx5_core(E) binfmt_misc(E) kvm_intel(E) mlxfw(E) psample(E) kvm(E) nls_iso8859_1(E) tls(E) rapl(E) isst_if_mbox_pci(E) joydev(E) pci_hyperv_intf(E) isst_if_mmio(E) input_leds(E) isst_if_common(E) pmt_telemetry(E) pmt_crashlog(E) idxd(E) pmt_class(E) idxd_bus(E) mei_me(E) mei(E) ipmi_ssif(E) acpi_ipmi(E) ipmi_si(E) ipmi_devintf(E) ipmi_msghandler(E) acpi_power_meter(E) acpi_pad(E) mac_hid(E)
[ 2705.498389]  sch_fq_codel(E) msr(E) parport_pc(E) ppdev(E) lp(E) ramoops(E) parport(E) pstore_blk(E) reed_solomon(E) pstore_zone(E) efi_pstore(E) ip_tables(E) x_tables(E) autofs4(E) btrfs(E) blake2b_generic(E) zstd_compress(E) raid10(E) raid456(E) async_raid6_recov(E) async_memcpy(E) async_pq(E) async_xor(E) async_tx(E) xor(E) rndis_host(E) cdc_ether(E) usbnet(E) mii(E) raid6_pq(E) libcrc32c(E) raid1(E) raid0(E) multipath(E) linear(E) dm_mirror(E) dm_region_hash(E) dm_log(E) hid_generic(E) usbhid(E) hid(E) crct10dif_pclmul(E) crc32_pclmul(E) ghash_clmulni_intel(E) aesni_intel(E) crypto_simd(E) cryptd(E) intel_pmt(E) ast(E) drm_vram_helper(E) drm_ttm_helper(E) ttm(E) nvme(E) drm_kms_helper(E) syscopyarea(E) sysfillrect(E) sysimgblt(E) fb_sys_fops(E) cec(E) rc_core(E) i2c_i801(E) igb(E) ahci(E) i2c_smbus(E) nvme_core(E) drm(E) dca(E) libahci(E) i2c_ismt(E) i2c_algo_bit(E) xhci_pci(E) xhci_pci_renesas(E) wmi(E) pinctrl_emmitsburg(E)
[ 2705.498436] CPU: 20 PID: 309 Comm: kswapd1 Tainted: G      D    OE     5.15.19-htmm #4
[ 2705.498438] Hardware name: Supermicro SYS-511E-WR/X13SEW-F, BIOS 1.0 09/26/2022
[ 2705.498439] RIP: 0010:do_exit+0x49/0xad0
[ 2705.498443] Code: 83 ec 30 65 48 8b 04 25 28 00 00 00 48 89 45 d0 31 c0 48 8b 83 80 0c 00 00 48 85 c0 74 0e 48 8b 10 48 39 d0 0f 84 b0 04 00 00 <0f> 0b 65 8b 0d be 79 f6 7e 89 c8 25 00 ff ff 00 89 45 ac 0f 85 5f
[ 2705.498445] RSP: 0018:ffffc9000e63fef0 EFLAGS: 00010016
[ 2705.498447] RAX: ffffc9000e63fc20 RBX: ffff888119e25d00 RCX: 0000000000000027
[ 2705.498448] RDX: ffff8881353b7e48 RSI: ffffc9000e63f440 RDI: 0000000000000009
[ 2705.498450] RBP: ffffc9000e63ff48 R08: ffff889fffd20980 R09: 0000000000000001
[ 2705.498451] R10: 0000000000ffff0a R11: 000000000000000f R12: 0000000000000009
[ 2705.498453] R13: 0000000000000009 R14: ffffc9000e63f7a8 R15: 0000000000000046
[ 2705.498454] FS:  0000000000000000(0000) GS:ffff889fffd00000(0000) knlGS:0000000000000000
[ 2705.498456] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 2705.498458] CR2: 000000000000000c CR3: 00000002141f8001 CR4: 0000000000770ee0
[ 2705.498460] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 2705.498461] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
[ 2705.498462] PKRU: 55555554
[ 2705.498464] Call Trace:
[ 2705.498465]  <TASK>
[ 2705.498466]  ? kthread+0x127/0x150
[ 2705.498470]  rewind_stack_do_exit+0x17/0x20
[ 2705.498475]  </TASK>
[ 2705.498476] ---[ end trace 49476d92555e0dc2 ]---