Open rincebrain opened 3 years ago
Oh boy, 4.15.0-2-sparc64 actually gave me a stacktrace...
[ 1004.096214] Kernel panic - not syncing: corrupted stack end detected inside scheduler
[ 1004.096214]
[ 1004.218708] CPU: 0 PID: 23350 Comm: spl_system_task Tainted: P O 4.15.0-2-sparc64 #1 Debian 4.15.11-1
[ 1004.356022] Call Trace:
[ 1004.388180] [00000000004668f0] panic+0xd0/0x280
[ 1004.448890] [00000000009f8ccc] switch_to_pc+0x4f8/0x50c
[ 1004.518762] [00000000009f8e9c] _cond_resched+0x3c/0x60
[ 1004.587500] [00000000009fa06c] mutex_lock+0xc/0x40
[ 1004.652797] [00000000108c968c] zio_wait_for_children+0xc/0xc0 [zfs]
[ 1004.736825] [00000000108ca304] zio_vdev_io_done+0x24/0x200 [zfs]
[ 1004.817421] [00000000108cb9b0] zio_execute+0x90/0x100 [zfs]
[ 1004.892274] [000000001088a160] vdev_mirror_io_start+0x100/0x280 [zfs]
[ 1004.978602] [00000000108cd008] zio_vdev_io_start+0x2c8/0x320 [zfs]
[ 1005.061473] [00000000108cf674] zio_nowait+0xb4/0x140 [zfs]
[ 1005.135133] [00000000107d54b8] arc_read+0xb58/0x1140 [zfs]
[ 1005.208761] [00000000107e2c04] dbuf_issue_final_prefetch+0x84/0x100 [zfs]
[ 1005.299546] [00000000107e87d8] dbuf_prefetch_indirect_done+0x1d8/0x200 [zfs]
[ 1005.393751] [00000000107d5cf8] arc_read_done+0x258/0x440 [zfs]
[ 1005.472020] [00000000108d16d0] zio_done+0x470/0xe40 [zfs]
[ 1005.544597] [00000000108cb9b0] zio_execute+0x90/0x100 [zfs]
[ 1005.619062] Press Stop-A (L1-A) from sun keyboard or send break
[ 1005.619062] twice on console to return to the boot prom
[ 1005.765570] ---[ end Kernel panic - not syncing: corrupted stack end detected inside scheduler
[ 1005.765570]
This issue has been automatically marked as "stale" because it has not had any activity for a while. It will be closed in 90 days if no further activity occurs. Thank you for your contributions.
This issue has been automatically marked as "stale" because it has not had any activity for a while. It will be closed in 90 days if no further activity occurs. Thank you for your contributions.
System information
Describe the problem you're observing
Trying to run through ZTS for #12022, found that on vanilla git master (or my patched branch, for that matter), running the whole series of rsend tests will, when it gets to rsend_012_pos, for whatever reason, cause the kernel to crash and burn 100% of the time. (Unhelpfully, it fails to print a stacktrace - the full output to console is reproduced below.)
Sometimes, it's unhappy enough that the watchdog timer doesn't trigger and pressing break twice doesn't work to get back to prom, leaving you to physically power cycle it.
(It seems potentially relevant to mention this is a Netra T1 - so it's possible other Linux/SPARC64 hardware might not suffer from this? IDK, I do not know what's breaking right now.)
Describe how to reproduce the problem
`scripts/zfs-tests.sh -r rsend
Include any warning/errors/backtraces from the system logs
crash output to console:
/proc/cpuinfo