xlab-uiuc / memtis

Tiered memory management
0 stars 0 forks source link

kernel null pointer dereference error while running liblinear #1

Open yulistic opened 6 months ago

yulistic commented 6 months ago

The following is the dmesg output.

[ 3735.534244] echo (3297): drop_caches: 3
[ 3794.127492] BUG: kernel NULL pointer dereference, address: 000000000000000c
[ 3794.127499] #PF: supervisor read access in kernel mode
[ 3794.127501] #PF: error_code(0x0000) - not-present page
[ 3794.127503] PGD 0 P4D 0
[ 3794.127506] Oops: 0000 [#1] SMP NOPTI
[ 3794.127510] CPU: 15 PID: 309 Comm: kswapd1 Tainted: G           OE     5.15.19-htmm #4
[ 3794.127513] Hardware name: Supermicro SYS-511E-WR/X13SEW-F, BIOS 1.0 09/26/2022
[ 3794.127515] RIP: 0010:remove_migration_pte+0x3d4/0x6e0
[ 3794.127522] Code: 80 48 2b 15 56 f8 38 01 48 01 ca 48 8b 35 3c f8 38 01 25 f8 0f 00 00 48 c1 ea 0c 48 c1 e2 06 48 03 44 16 18 0f 84 11 fd ff ff <66> 83 78 04 00 0f 85 06 fd ff ff 4c 89 ff ba 00 10 00 00 48 29 f7
[ 3794.127524] RSP: 0018:ffffc9000e63f850 EFLAGS: 00010202
[ 3794.127527] RAX: 0000000000000008 RBX: ffffea00848c8040 RCX: ffff88a118577008
[ 3794.127529] RDX: 0000000082615dc0 RSI: ffffea0000000000 RDI: 000000000000001b
[ 3794.127530] RBP: ffffc9000e63f8c0 R08: ffffff8000000000 R09: 0000008000000000
[ 3794.127532] R10: 0400000000000080 R11: 0000000000000067 R12: 0000000000000001
[ 3794.127533] R13: ffff88818c5e1838 R14: ffffc9000e63f938 R15: ffffea00848c8040
[ 3794.127535] FS:  0000000000000000(0000) GS:ffff889fffbc0000(0000) knlGS:0000000000000000
[ 3794.127537] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 3794.127538] CR2: 000000000000000c CR3: 0000000115e0c005 CR4: 0000000000770ee0
[ 3794.127540] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 3794.127541] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
[ 3794.127543] PKRU: 55555554
[ 3794.127544] Call Trace:
[ 3794.127546]  <TASK>
[ 3794.127550]  rmap_walk_anon+0xe8/0x260
[ 3794.127556]  rmap_walk_locked+0x28/0x40
[ 3794.127559]  remove_migration_ptes+0x5e/0x80
[ 3794.127562]  ? trace_event_raw_event_mm_migrate_pages_start+0xb0/0xb0
[ 3794.127566]  remap_page+0x52/0x80
[ 3794.127568]  split_huge_page_to_list+0x8a9/0xc40
[ 3794.127573]  shrink_page_list+0xdb3/0xf00
[ 3794.127578]  shrink_inactive_list+0x1ff/0x480
[ 3794.127581]  shrink_lruvec+0x439/0x720
[ 3794.127585]  shrink_node+0x2c6/0x760
[ 3794.127588]  balance_pgdat+0x36a/0x750
[ 3794.127591]  kswapd+0x202/0x3a0
[ 3794.127594]  ? wait_woken+0x60/0x60
[ 3794.127598]  ? balance_pgdat+0x750/0x750
[ 3794.127600]  kthread+0x127/0x150
[ 3794.127604]  ? set_kthread_struct+0x40/0x40
[ 3794.127608]  ret_from_fork+0x1f/0x30
[ 3794.127613]  </TASK>
[ 3794.127614] Modules linked in: xt_CHECKSUM(E) xt_MASQUERADE(E) xt_conntrack(E) ipt_REJECT(E) nf_reject_ipv4(E) xt_tcpudp(E) nft_compat(E) nft_chain_nat(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E) nf_defrag_ipv4(E) nft_counter(E) nf_tables(E) nfnetlink(E) bridge(E) stp(E) llc(E) rpcsec_gss_krb5(E) auth_rpcgss(E) nfsv4(E) nfs(E) lockd(E) grace(E) fscache(E) netfs(E) nvme_fabrics(E) socwatch2_15(OE) vtsspp(OE) sep5(OE) socperf3(OE) pax(OE) mlx5_ib(E) ib_uverbs(E) ib_core(E) intel_rapl_msr(E) intel_rapl_common(E) i10nm_edac(E) nfit(E) x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) kvm_intel(E) sunrpc(E) binfmt_misc(E) mlx5_core(E) kvm(E) mlxfw(E) nls_iso8859_1(E) psample(E) rapl(E) isst_if_mbox_pci(E) joydev(E) tls(E) pci_hyperv_intf(E) isst_if_mmio(E) isst_if_common(E) pmt_telemetry(E) pmt_crashlog(E) pmt_class(E) idxd(E) idxd_bus(E) input_leds(E) mei_me(E) mei(E) ipmi_ssif(E) acpi_ipmi(E) ipmi_si(E) ipmi_devintf(E) ipmi_msghandler(E) acpi_power_meter(E) acpi_pad(E) mac_hid(E)
[ 3794.127666]  sch_fq_codel(E) msr(E) parport_pc(E) ppdev(E) lp(E) parport(E) ramoops(E) reed_solomon(E) pstore_blk(E) pstore_zone(E) efi_pstore(E) ip_tables(E) x_tables(E) autofs4(E) btrfs(E) blake2b_generic(E) zstd_compress(E) raid10(E) raid456(E) async_raid6_recov(E) async_memcpy(E) async_pq(E) async_xor(E) async_tx(E) xor(E) rndis_host(E) cdc_ether(E) usbnet(E) mii(E) raid6_pq(E) libcrc32c(E) raid1(E) raid0(E) multipath(E) linear(E) dm_mirror(E) dm_region_hash(E) dm_log(E) hid_generic(E) usbhid(E) hid(E) ast(E) drm_vram_helper(E) drm_ttm_helper(E) ttm(E) drm_kms_helper(E) syscopyarea(E) sysfillrect(E) sysimgblt(E) fb_sys_fops(E) crct10dif_pclmul(E) cec(E) crc32_pclmul(E) rc_core(E) ghash_clmulni_intel(E) nvme(E) ahci(E) aesni_intel(E) igb(E) crypto_simd(E) dca(E) i2c_i801(E) xhci_pci(E) cryptd(E) intel_pmt(E) drm(E) nvme_core(E) i2c_smbus(E) libahci(E) i2c_ismt(E) i2c_algo_bit(E) xhci_pci_renesas(E) wmi(E) pinctrl_emmitsburg(E)
[ 3794.127724] CR2: 000000000000000c
[ 3794.127726] ---[ end trace 249316dfaff24c69 ]---
[ 3794.138587] RIP: 0010:remove_migration_pte+0x3d4/0x6e0
[ 3794.138591] Code: 80 48 2b 15 56 f8 38 01 48 01 ca 48 8b 35 3c f8 38 01 25 f8 0f 00 00 48 c1 ea 0c 48 c1 e2 06 48 03 44 16 18 0f 84 11 fd ff ff <66> 83 78 04 00 0f 85 06 fd ff ff 4c 89 ff ba 00 10 00 00 48 29 f7
[ 3794.138593] RSP: 0018:ffffc9000e63f850 EFLAGS: 00010202
[ 3794.138595] RAX: 0000000000000008 RBX: ffffea00848c8040 RCX: ffff88a118577008
[ 3794.138597] RDX: 0000000082615dc0 RSI: ffffea0000000000 RDI: 000000000000001b
[ 3794.138598] RBP: ffffc9000e63f8c0 R08: ffffff8000000000 R09: 0000008000000000
[ 3794.138600] R10: 0400000000000080 R11: 0000000000000067 R12: 0000000000000001
[ 3794.138601] R13: ffff88818c5e1838 R14: ffffc9000e63f938 R15: ffffea00848c8040
[ 3794.138603] FS:  0000000000000000(0000) GS:ffff889fffbc0000(0000) knlGS:0000000000000000
[ 3794.138605] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 3794.138607] CR2: 000000000000000c CR3: 0000000115e0c005 CR4: 0000000000770ee0
[ 3794.138608] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 3794.138610] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
[ 3794.138611] PKRU: 55555554
[ 3794.138646] ------------[ cut here ]------------
[ 3794.138647] WARNING: CPU: 15 PID: 309 at kernel/exit.c:742 do_exit+0x49/0xad0
[ 3794.138653] Modules linked in: xt_CHECKSUM(E) xt_MASQUERADE(E) xt_conntrack(E) ipt_REJECT(E) nf_reject_ipv4(E) xt_tcpudp(E) nft_compat(E) nft_chain_nat(E) nf_nat(E) nf_conntrack(E) nf_defrag_ipv6(E) nf_defrag_ipv4(E) nft_counter(E) nf_tables(E) nfnetlink(E) bridge(E) stp(E) llc(E) rpcsec_gss_krb5(E) auth_rpcgss(E) nfsv4(E) nfs(E) lockd(E) grace(E) fscache(E) netfs(E) nvme_fabrics(E) socwatch2_15(OE) vtsspp(OE) sep5(OE) socperf3(OE) pax(OE) mlx5_ib(E) ib_uverbs(E) ib_core(E) intel_rapl_msr(E) intel_rapl_common(E) i10nm_edac(E) nfit(E) x86_pkg_temp_thermal(E) intel_powerclamp(E) coretemp(E) kvm_intel(E) sunrpc(E) binfmt_misc(E) mlx5_core(E) kvm(E) mlxfw(E) nls_iso8859_1(E) psample(E) rapl(E) isst_if_mbox_pci(E) joydev(E) tls(E) pci_hyperv_intf(E) isst_if_mmio(E) isst_if_common(E) pmt_telemetry(E) pmt_crashlog(E) pmt_class(E) idxd(E) idxd_bus(E) input_leds(E) mei_me(E) mei(E) ipmi_ssif(E) acpi_ipmi(E) ipmi_si(E) ipmi_devintf(E) ipmi_msghandler(E) acpi_power_meter(E) acpi_pad(E) mac_hid(E)
[ 3794.138697]  sch_fq_codel(E) msr(E) parport_pc(E) ppdev(E) lp(E) parport(E) ramoops(E) reed_solomon(E) pstore_blk(E) pstore_zone(E) efi_pstore(E) ip_tables(E) x_tables(E) autofs4(E) btrfs(E) blake2b_generic(E) zstd_compress(E) raid10(E) raid456(E) async_raid6_recov(E) async_memcpy(E) async_pq(E) async_xor(E) async_tx(E) xor(E) rndis_host(E) cdc_ether(E) usbnet(E) mii(E) raid6_pq(E) libcrc32c(E) raid1(E) raid0(E) multipath(E) linear(E) dm_mirror(E) dm_region_hash(E) dm_log(E) hid_generic(E) usbhid(E) hid(E) ast(E) drm_vram_helper(E) drm_ttm_helper(E) ttm(E) drm_kms_helper(E) syscopyarea(E) sysfillrect(E) sysimgblt(E) fb_sys_fops(E) crct10dif_pclmul(E) cec(E) crc32_pclmul(E) rc_core(E) ghash_clmulni_intel(E) nvme(E) ahci(E) aesni_intel(E) igb(E) crypto_simd(E) dca(E) i2c_i801(E) xhci_pci(E) cryptd(E) intel_pmt(E) drm(E) nvme_core(E) i2c_smbus(E) libahci(E) i2c_ismt(E) i2c_algo_bit(E) xhci_pci_renesas(E) wmi(E) pinctrl_emmitsburg(E)
[ 3794.138743] CPU: 15 PID: 309 Comm: kswapd1 Tainted: G      D    OE     5.15.19-htmm #4
[ 3794.138745] Hardware name: Supermicro SYS-511E-WR/X13SEW-F, BIOS 1.0 09/26/2022
[ 3794.138747] RIP: 0010:do_exit+0x49/0xad0
[ 3794.138750] Code: 83 ec 30 65 48 8b 04 25 28 00 00 00 48 89 45 d0 31 c0 48 8b 83 80 0c 00 00 48 85 c0 74 0e 48 8b 10 48 39 d0 0f 84 b0 04 00 00 <0f> 0b 65 8b 0d be 79 f6 7e 89 c8 25 00 ff ff 00 89 45 ac 0f 85 5f
[ 3794.138752] RSP: 0018:ffffc9000e63fef0 EFLAGS: 00010012
[ 3794.138754] RAX: ffffc9000e63fc20 RBX: ffff88810f2a0000 RCX: 0000000000000027
[ 3794.138756] RDX: ffff8881361c03c8 RSI: ffffc9000e63f440 RDI: 0000000000000009
[ 3794.138757] RBP: ffffc9000e63ff48 R08: ffff889fffbe0980 R09: 0000000000000001
[ 3794.138759] R10: 0000000000ffff0a R11: 000000000000000f R12: 0000000000000009
[ 3794.138760] R13: 0000000000000009 R14: ffffc9000e63f7a8 R15: 0000000000000046
[ 3794.138762] FS:  0000000000000000(0000) GS:ffff889fffbc0000(0000) knlGS:0000000000000000
[ 3794.138763] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 3794.138765] CR2: 000000000000000c CR3: 0000000115e0c005 CR4: 0000000000770ee0
[ 3794.138767] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 3794.138768] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
[ 3794.138769] PKRU: 55555554
[ 3794.138771] Call Trace:
[ 3794.138771]  <TASK>
[ 3794.138773]  ? kthread+0x127/0x150
[ 3794.138777]  rewind_stack_do_exit+0x17/0x20
[ 3794.138781]  </TASK>
[ 3794.138783] ---[ end trace 249316dfaff24c6a ]---