openzfs / spl

A shim layer which adds the core interfaces required for OpenZFS.
https://zfsonlinux.org/
GNU General Public License v2.0
281 stars 180 forks source link

unable to handle kernel NULL pointer dereference at 0000000000000010 at spl_kmem_cache_alloc+0x72/0x8d0 #624

Closed krichter722 closed 6 years ago

krichter722 commented 7 years ago

I just experienced FIrefox and other running application to become unresponsive and hang at I/O (but not all) after changing a browser tab with the following backtrace logged in dmesg:

[19224.970329] BUG: unable to handle kernel NULL pointer dereference at 0000000000000010
[19224.970367] IP: avl_rotation.isra.0+0x52/0x250 [zavl]
[19224.970395] PGD 0 

[19224.970410] Oops: 0000 [#1] SMP
[19224.970421] Modules linked in: xt_CHECKSUM iptable_mangle ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_nat_ipv4 nf_nat libcrc32c nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp bridge stp llc rfcomm msr ebtable_filter ebtables ip6table_filter ip6_tables snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer snd soundcore iptable_filter pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) bbswitch(OE) bnep vboxdrv(OE) cdc_ether usbnet r8152 btusb btrtl btbcm btintel bluetooth rtsx_usb_ms memstick binfmt_misc uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core videodev media hid_multitouch zfs(POE) zunicode(POE) zavl(POE) icp(POE) nls_iso8859_1 zcommon(POE)
[19224.970635]  znvpair(POE) spl(OE) intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd intel_cstate intel_rapl_perf input_leds joydev serio_raw ideapad_laptop sparse_keymap wmi arc4 iwldvm mac80211 iwlwifi cfg80211 mac_hid lpc_ich mei_me mei shpchp ib_iser rdma_cm iw_cm ib_cm ib_core configfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi coda parport_pc ppdev lp parport sunrpc ip_tables x_tables autofs4 btrfs xor raid6_pq rtsx_usb_sdmmc rtsx_usb hid_generic usbhid hid psmouse r8169 mii i915 fjes video i2c_algo_bit ahci drm_kms_helper libahci syscopyarea sysfillrect sysimgblt fb_sys_fops drm [last unloaded: nvidia]
[19224.970828] CPU: 4 PID: 13503 Comm: txg_sync Tainted: P           OE   4.10.0-26-generic #30-Ubuntu
[19224.970853] Hardware name: LENOVO 20221/INVALID, BIOS 71CN51WW(V1.21) 07/12/2013
[19224.970874] task: ffff9d6814f42d00 task.stack: ffffc1c50b32c000
[19224.970892] RIP: 0010:avl_rotation.isra.0+0x52/0x250 [zavl]
[19224.970908] RSP: 0018:ffffc1c50b32fb08 EFLAGS: 00010a07
[19224.970924] RAX: 3fffe75963e0ff17 RBX: 0000000000000001 RCX: 0000000000000000
[19224.970943] RDX: 0000000000000001 RSI: ffff9d66a85bb618 RDI: ffff9d68053e9128
[19224.970963] RBP: ffffc1c50b32fb40 R08: 0000000000000001 R09: 0000000000000000
[19224.970983] R10: ffff9d658f83fc5e R11: 00000000ffffffff R12: 0000000000000008
[19224.971003] R13: ffffffffc083f130 R14: ffff9d658f83fc58 R15: ffff9d66a85bb620
[19224.971023] FS:  0000000000000000(0000) GS:ffff9d683f300000(0000) knlGS:0000000000000000
[19224.971045] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[19224.971062] CR2: 0000000000000010 CR3: 0000000066609000 CR4: 00000000001406e0
[19224.971082] Call Trace:
[19224.971096]  ? spl_kmem_cache_alloc+0x72/0x8d0 [spl]
[19224.972124]  avl_remove+0x1d6/0x250 [zavl]
[19224.973161]  metaslab_rt_remove+0x2b/0x60 [zfs]
[19224.974195]  range_tree_add+0x238/0x2d0 [zfs]
[19224.975305]  ? kmem_cache_free+0x1b6/0x1e0
[19224.975332]  ? range_tree_destroy+0x70/0x70 [zfs]
[19224.975365]  range_tree_vacate+0x5c/0xe0 [zfs]
[19224.975395]  ? range_tree_destroy+0x70/0x70 [zfs]
[19224.975443]  metaslab_sync_done+0x11c/0x6d0 [zfs]
[19224.975472]  vdev_sync_done+0x3e/0x70 [zfs]
[19224.975498]  spa_sync+0x641/0xdc0 [zfs]
[19224.975524]  txg_sync_thread+0x2e2/0x4b0 [zfs]
[19224.975549]  ? txg_quiesce_thread+0x3f0/0x3f0 [zfs]
[19224.975553]  thread_generic_wrapper+0x72/0x80 [spl]
[19224.975555]  kthread+0x109/0x140
[19224.975557]  ? __thread_exit+0x20/0x20 [spl]
[19224.975559]  ? kthread_create_on_node+0x60/0x60
[19224.975561]  ret_from_fork+0x2c/0x40
[19224.975562] Code: 41 56 41 55 41 54 4e 8d 24 cd 00 00 00 00 53 41 f7 db 44 29 c1 4e 8d 3c 26 48 83 ec 10 48 8b 46 10 4d 8b 0f 49 89 c2 48 c1 e8 02 <41> 8b 59 10 49 83 e2 f8 83 e0 01 83 e3 03 83 eb 01 41 39 db 0f 
[19224.975576] RIP: avl_rotation.isra.0+0x52/0x250 [zavl] RSP: ffffc1c50b32fb08
[19224.975576] CR2: 0000000000000010
[19224.979856] ---[ end trace f43c5270ef3c1896 ]---

I could reboot from another tty.

experienced with SPL 0.7.0-rc4_5_g7a35f2b4 and ZFS 0.7.0-rc4_77_g682ce104cd on Ubuntu 17.04 amd64 with Linux 4.10.0-26-generic

behlendorf commented 6 years ago

Closing as stale. If it's observed again please open a new issue on the zfs issue tracker.