openzfs / zfs

OpenZFS on Linux and FreeBSD
https://openzfs.github.io/openzfs-docs
Other
10.55k stars 1.74k forks source link

BUG: divide error: 0000 [#1] PREEMPT SMP NOPTI #11382

Closed reactormonk closed 2 years ago

reactormonk commented 3 years ago

System information

Type Version/Name
Distribution Name Archlinux
Distribution Version 2.0.0_5.9.14.arch1.1-1
Linux Kernel 5.9.14.arch1.1
Architecture x86_64
ZFS Version 2.0.0
SPL Version 2.0.0

Describe the problem you're observing

zfs module kernel panic

Describe how to reproduce the problem

No clue, really.

Include any warning/errors/backtraces from the system logs

[154752.104314] divide error: 0000 [#1] PREEMPT SMP NOPTI
[154752.104320] CPU: 5 PID: 1097 Comm: txg_sync Tainted: P    B   W  OE     5.9.14-arch1-1 #1
[154752.104322] Hardware name: System manufacturer System Product Name/PRIME B350-PLUS, BIOS 5407 12/31/2019
[154752.104399] RIP: 0010:spa_taskq_dispatch_ent+0xb0/0xc0 [zfs]
[154752.104402] Code: c7 04 24 00 00 00 00 48 c7 44 24 08 00 00 00 00 e8 a5 33 1a d6 8b 8b 78 01 00 00 31 d2 48 69 04 24 00 ca 9a 3b 48 03 44 24 08 <48> f7 f1 49 8b 3c d7 eb 9a e8 c2 66 aa d6 66 90 0f 1f 44 00 00 41
[154752.104405] RSP: 0018:ffffb16fcf773ba0 EFLAGS: 00010206
[154752.104408] RAX: 00008cbf0a2f79c0 RBX: ffff9519c2aa0480 RCX: 0000000000000000
[154752.104410] RDX: 0000000000000000 RSI: 00000000008a4312 RDI: 00000000058887f4
[154752.104412] RBP: ffffffffc1bea8f0 R08: 000809bbe14315c8 R09: 0000000000000000
[154752.104414] R10: 0000000000000005 R11: 0000000000000005 R12: ffff9519a3f71d40
[154752.104416] R13: 0000000000000000 R14: ffff9519a3f72198 R15: 0000000000000000
[154752.104418] FS:  0000000000000000(0000) GS:ffff9519fe940000(0000) knlGS:0000000000000000
[154752.104420] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[154752.104422] CR2: 00000000036cd250 CR3: 00000006574ee000 CR4: 00000000003506e0
[154752.104423] Call Trace:
[154752.104482]  zio_issue_async+0x52/0x90 [zfs]
[154752.104535]  zio_nowait+0xc5/0x1e0 [zfs]
[154752.104579]  dmu_objset_sync+0x411/0x570 [zfs]
[154752.104624]  dsl_dataset_sync+0x6d/0x270 [zfs]
[154752.104673]  dsl_pool_sync+0xb5/0x4c0 [zfs]
[154752.104721]  spa_sync+0x56d/0xfc0 [zfs]
[154752.104727]  ? _raw_spin_lock+0x13/0x30
[154752.104776]  ? spa_txg_history_init_io+0x112/0x120 [zfs]
[154752.104825]  txg_sync_thread+0x2dc/0x4a0 [zfs]
[154752.104875]  ? txg_register_callbacks+0xa0/0xa0 [zfs]
[154752.104884]  ? __thread_exit+0x20/0x20 [spl]
[154752.104889]  thread_generic_wrapper+0x6f/0x80 [spl]
[154752.104895]  kthread+0x142/0x160
[154752.104899]  ? __kthread_bind_mask+0x60/0x60
[154752.104904]  ret_from_fork+0x22/0x30
[154752.104906] Modules linked in: uinput uas usb_storage snd_seq_dummy snd_seq rfcomm nfnetlink cmac algif_hash algif_skcipher af_alg bnep ext4 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm loop zfs(POE) zunicode(POE) zzstd(OE) eeepc_wmi asus_wmi battery sparse_keymap wmi_bmof zlua(OE) uvcvideo snd_usb_audio videobuf2_vmalloc zavl(POE) videobuf2_memops snd_usbmidi_lib videobuf2_v4l2 nls_iso8859_1 videobuf2_common icp(POE) nls_cp437 snd_rawmidi videodev vfat snd_seq_device amdgpu edac_mce_amd fat hid_lenovo mc kvm_amd btusb snd_hda_codec_realtek kvm btrtl btbcm snd_hda_codec_generic btintel gpu_sched ledtrig_audio bluetooth snd_hda_codec_hdmi snd_hda_intel i2c_algo_bit irqbypass snd_intel_dspcfg crct10dif_pclmul ttm crc32_pclmul snd_hda_codec ghash_clmulni_intel aesni_intel drm_kms_helper snd_hda_core ecdh_generic xpad crypto_simd rfkill cryptd glue_helper ff_memless joydev cec r8169 mousedev snd_hwdep ecc crc16 input_leds snd_pcm rc_core realtek snd_timer syscopyarea mdio_devres
[154752.104959]  sysfillrect of_mdio snd sysimgblt rapl fb_sys_fops fixed_phy pcspkr soundcore libphy ccp sp5100_tco k10temp i2c_piix4 rng_core wmi evdev pinctrl_amd gpio_amdpt mac_hid acpi_cpufreq zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) dm_mod drm usbip_host usbip_core fuse crypto_user agpgart ip_tables x_tables hid_steam hid_generic usbhid hid btrfs blake2b_generic libcrc32c crc32c_generic xor raid6_pq crc32c_intel xhci_pci xhci_pci_renesas xhci_hcd
[154752.105082] ---[ end trace 208addf76b0b6a0d ]---
[154752.105130] RIP: 0010:spa_taskq_dispatch_ent+0xb0/0xc0 [zfs]
[154752.105133] Code: c7 04 24 00 00 00 00 48 c7 44 24 08 00 00 00 00 e8 a5 33 1a d6 8b 8b 78 01 00 00 31 d2 48 69 04 24 00 ca 9a 3b 48 03 44 24 08 <48> f7 f1 49 8b 3c d7 eb 9a e8 c2 66 aa d6 66 90 0f 1f 44 00 00 41
[154752.105135] RSP: 0018:ffffb16fcf773ba0 EFLAGS: 00010206
[154752.105137] RAX: 00008cbf0a2f79c0 RBX: ffff9519c2aa0480 RCX: 0000000000000000
[154752.105139] RDX: 0000000000000000 RSI: 00000000008a4312 RDI: 00000000058887f4
[154752.105141] RBP: ffffffffc1bea8f0 R08: 000809bbe14315c8 R09: 0000000000000000
[154752.105143] R10: 0000000000000005 R11: 0000000000000005 R12: ffff9519a3f71d40
[154752.105144] R13: 0000000000000000 R14: ffff9519a3f72198 R15: 0000000000000000
[154752.105147] FS:  0000000000000000(0000) GS:ffff9519fe940000(0000) knlGS:0000000000000000
[154752.105149] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[154752.105150] CR2: 00000000036cd250 CR3: 00000006574ee000 CR4: 00000000003506e0
reactormonk commented 3 years ago

I think this is a similar one, triggered by a zfs send/receive

17032.567735] divide error: 0000 [#1] PREEMPT SMP NOPTI
[17032.567743] CPU: 0 PID: 2979329 Comm: z_wr_int Tainted: P           OE     5.10.5-arch1-1 #1
[17032.567745] Hardware name: System manufacturer System Product Name/PRIME B350-PLUS, BIOS 5407 12/31/2019
[17032.567827] RIP: 0010:spa_taskq_dispatch_ent+0xb0/0xc0 [zfs]
[17032.567833] Code: c7 04 24 00 00 00 00 48 c7 44 24 08 00 00 00 00 e8 15 f9 b7 e5 8b 8b 78 01 00 00 31 d2 48 69 04 24 00 ca 9a 3b 48 03 44 24 08 <48> f7 f1 49 8b 3c d7 eb 9a e8 52 03 4d e6 66 90 0f 1f 44 00 00 41
[17032.567835] RSP: 0018:ffffa6908fb3bcf8 EFLAGS: 00010206
[17032.567839] RAX: 00000f7da4431a09 RBX: ffff8e93047ac480 RCX: 0000000000000000
[17032.567840] RDX: 0000000000000000 RSI: 0000000000ba304d RDI: 0000000000970036
[17032.567842] RBP: ffffffffc1a14030 R08: 0011bcecc974d232 R09: 0000000008000000
[17032.567843] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8e94e3f71d40
[17032.567845] R13: 0000000008000000 R14: ffff8e94e3f72198 R15: 0000000000000000
[17032.567847] FS:  0000000000000000(0000) GS:ffff8e953e800000(0000) knlGS:0000000000000000
[17032.567849] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[17032.567850] CR2: 00002d9efffb6000 CR3: 00000002b9aca000 CR4: 00000000003506f0
[17032.567852] Call Trace:
[17032.567913]  zio_taskq_dispatch+0x61/0xa0 [zfs]
[17032.567969]  zio_done+0xa45/0x1150 [zfs]
[17032.568017]  ? dsl_pool_undirty_space.part.0+0x90/0xb0 [zfs]
[17032.568070]  zio_execute+0x81/0x120 [zfs]
[17032.568079]  taskq_thread+0x28e/0x4b0 [spl]
[17032.568086]  ? wake_up_q+0xa0/0xa0
[17032.568141]  ? zio_vdev_io_done+0x210/0x210 [zfs]
[17032.568149]  ? taskq_thread_spawn+0x50/0x50 [spl]
[17032.568153]  kthread+0x133/0x150
[17032.568156]  ? __kthread_bind_mask+0x60/0x60
[17032.568160]  ret_from_fork+0x22/0x30
[17032.568163] Modules linked in: uinput rfcomm nfnetlink cmac algif_hash algif_skcipher af_alg bnep ext4 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm loop zfs(POE) nls_iso8859_1 vfat snd_hda_codec_realtek fat snd_hda_codec_generic amdgpu hid_lenovo ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation soundwire_cadence r8169 mousedev realtek snd_hda_codec edac_mce_amd btusb eeepc_wmi mdio_devres btrtl snd_hda_core kvm_amd btbcm libphy hid_steam asus_wmi snd_usb_audio soundwire_bus btintel joydev sparse_keymap gpu_sched wmi_bmof kvm snd_soc_core video bluetooth i2c_algo_bit snd_usbmidi_lib snd_hwdep ttm snd_compress irqbypass snd_rawmidi crct10dif_pclmul ac97_bus snd_seq_device crc32_pclmul mc snd_pcm_dmaengine ghash_clmulni_intel ecdh_generic drm_kms_helper snd_pcm aesni_intel snd_timer cec crypto_simd zunicode(POE) rfkill ccp snd xpad cryptd syscopyarea glue_helper sysfillrect sp5100_tco sysimgblt ecc crc16 zzstd(OE) rapl
[17032.568233]  ff_memless usbhid soundcore pcspkr k10temp i2c_piix4 fb_sys_fops rng_core wmi gpio_amdpt gpio_generic pinctrl_amd acpi_cpufreq mac_hid zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) dm_mod usbip_host drm usbip_core crypto_user fuse agpgart bpf_preload ip_tables x_tables btrfs blake2b_generic libcrc32c crc32c_generic xor raid6_pq crc32c_intel xhci_pci xhci_pci_renesas
[17032.568270] ---[ end trace af189d6799e3213b ]---
[17032.570061] ------------[ cut here ]------------
[17032.570068] WARNING: CPU: 0 PID: 2979329 at kernel/rcu/tree_plugin.h:297 rcu_note_context_switch+0x42/0x480
[17032.570070] Modules linked in: uinput rfcomm nfnetlink cmac algif_hash algif_skcipher af_alg bnep ext4 mbcache jbd2 dm_crypt cbc encrypted_keys trusted tpm loop zfs(POE) nls_iso8859_1 vfat snd_hda_codec_realtek fat snd_hda_codec_generic amdgpu hid_lenovo ledtrig_audio snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation soundwire_cadence r8169 mousedev realtek snd_hda_codec edac_mce_amd btusb eeepc_wmi mdio_devres btrtl snd_hda_core kvm_amd btbcm libphy hid_steam asus_wmi snd_usb_audio soundwire_bus btintel joydev sparse_keymap gpu_sched wmi_bmof kvm snd_soc_core video bluetooth i2c_algo_bit snd_usbmidi_lib snd_hwdep ttm snd_compress irqbypass snd_rawmidi crct10dif_pclmul ac97_bus snd_seq_device crc32_pclmul mc snd_pcm_dmaengine ghash_clmulni_intel ecdh_generic drm_kms_helper snd_pcm aesni_intel snd_timer cec crypto_simd zunicode(POE) rfkill ccp snd xpad cryptd syscopyarea glue_helper sysfillrect sp5100_tco sysimgblt ecc crc16 zzstd(OE) rapl
[17032.570890]  ff_memless usbhid soundcore pcspkr k10temp i2c_piix4 fb_sys_fops rng_core wmi gpio_amdpt gpio_generic pinctrl_amd acpi_cpufreq mac_hid zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) dm_mod usbip_host drm usbip_core crypto_user fuse agpgart bpf_preload ip_tables x_tables btrfs blake2b_generic libcrc32c crc32c_generic xor raid6_pq crc32c_intel xhci_pci xhci_pci_renesas
[17032.570925] CPU: 0 PID: 2979329 Comm: z_wr_int Tainted: P      D    OE     5.10.5-arch1-1 #1
[17032.570926] Hardware name: System manufacturer System Product Name/PRIME B350-PLUS, BIOS 5407 12/31/2019
[17032.570930] RIP: 0010:rcu_note_context_switch+0x42/0x480
[17032.570934] Code: c3 40 cf 02 00 65 48 03 1d e3 ef af 58 0f 1f 44 00 00 40 84 ed 75 15 65 48 8b 04 25 c0 7b 01 00 8b 90 b0 03 00 00 85 d2 7e 02 <0f> 0b 65 48 8b 04 25 c0 7b 01 00 8b 80 b0 03 00 00 85 c0 7e 0a 41
[17032.570936] RSP: 0018:ffffa6908fb3b740 EFLAGS: 00010002
[17032.570938] RAX: ffff8e8ed3e2bd80 RBX: ffff8e953e82cf40 RCX: 0000000000000000
[17032.570940] RDX: 0000000000000001 RSI: ffffffffa7e7dc9b RDI: 0000000000000000
[17032.570941] RBP: 0000000000000000 R08: 0000000000000001 R09: 0000000000000400
[17032.570942] R10: abcc77118461cefd R11: ffffa6908fb3ba2f R12: 000000000002c180
[17032.570943] R13: ffff8e8ed3e2bd80 R14: ffff8e8ed3e2bd80 R15: ffff8e953e82c180
[17032.570946] FS:  0000000000000000(0000) GS:ffff8e953e800000(0000) knlGS:0000000000000000
[17032.570948] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[17032.570949] CR2: 00002d9efffb6000 CR3: 00000002b9aca000 CR4: 00000000003506f0
[17032.570951] Call Trace:
[17032.570967]  __schedule+0xae/0x810
[17032.570971]  schedule+0x5b/0xc0
[17032.570974]  schedule_timeout+0x11c/0x160
[17032.570978]  wait_for_completion+0x9e/0x100
[17032.570986]  virt_efi_query_variable_info+0x141/0x150
[17032.570994]  efi_query_variable_store+0x7b/0x1e0
[17032.571000]  ? vsnprintf+0x6e/0x4f0
[17032.571003]  efivar_entry_set_safe+0xbd/0x210
[17032.571007]  efi_pstore_write+0x12d/0x1a0
[17032.571014]  pstore_dump+0x1b3/0x2f0
[17032.571020]  kmsg_dump+0xa8/0xd0
[17032.571024]  oops_end+0x61/0xd0
[17032.571028]  do_trap+0x8d/0x110
[17032.571102]  ? spa_taskq_dispatch_ent+0xb0/0xc0 [zfs]
[17032.571106]  do_error_trap+0x64/0xa0
[17032.571156]  ? spa_taskq_dispatch_ent+0xb0/0xc0 [zfs]
[17032.571161]  exc_divide_error+0x36/0x50
[17032.571211]  ? spa_taskq_dispatch_ent+0xb0/0xc0 [zfs]
[17032.571214]  asm_exc_divide_error+0x12/0x20
[17032.571265] RIP: 0010:spa_taskq_dispatch_ent+0xb0/0xc0 [zfs]
[17032.571267] Code: c7 04 24 00 00 00 00 48 c7 44 24 08 00 00 00 00 e8 15 f9 b7 e5 8b 8b 78 01 00 00 31 d2 48 69 04 24 00 ca 9a 3b 48 03 44 24 08 <48> f7 f1 49 8b 3c d7 eb 9a e8 52 03 4d e6 66 90 0f 1f 44 00 00 41
[17032.571269] RSP: 0018:ffffa6908fb3bcf8 EFLAGS: 00010206
[17032.571271] RAX: 00000f7da4431a09 RBX: ffff8e93047ac480 RCX: 0000000000000000
[17032.571272] RDX: 0000000000000000 RSI: 0000000000ba304d RDI: 0000000000970036
[17032.571274] RBP: ffffffffc1a14030 R08: 0011bcecc974d232 R09: 0000000008000000
[17032.571276] R10: 0000000000000000 R11: 0000000000000000 R12: ffff8e94e3f71d40
[17032.571277] R13: 0000000008000000 R14: ffff8e94e3f72198 R15: 0000000000000000
[17032.571330]  ? zio_vdev_io_done+0x210/0x210 [zfs]
[17032.571386]  ? spa_taskq_dispatch_ent+0x9b/0xc0 [zfs]
[17032.571438]  zio_taskq_dispatch+0x61/0xa0 [zfs]
[17032.571493]  zio_done+0xa45/0x1150 [zfs]
[17032.571541]  ? dsl_pool_undirty_space.part.0+0x90/0xb0 [zfs]
[17032.571595]  zio_execute+0x81/0x120 [zfs]
[17032.571603]  taskq_thread+0x28e/0x4b0 [spl]
[17032.571608]  ? wake_up_q+0xa0/0xa0
[17032.571660]  ? zio_vdev_io_done+0x210/0x210 [zfs]
[17032.571665]  ? taskq_thread_spawn+0x50/0x50 [spl]
[17032.571668]  kthread+0x133/0x150
[17032.571670]  ? __kthread_bind_mask+0x60/0x60
[17032.571673]  ret_from_fork+0x22/0x30
[17032.571676] ---[ end trace af189d6799e3213c ]---
stale[bot] commented 2 years ago

This issue has been automatically marked as "stale" because it has not had any activity for a while. It will be closed in 90 days if no further activity occurs. Thank you for your contributions.