Open datacore-rm opened 2 years ago
Hmm doesn't ring any immediate bells, and this is pre-OS split too, so harder to connect up. Sean spent hours tracking down this problem: https://github.com/openzfs/zfs/commit/c6d1112bf4125e5a22eb47ceb7b8cee01f0df9a1 but that did not lead to NULL ptr, but rather wrong free.
Thanks. In https://github.com/openzfs/zfs/commit/47b408fcb32cdc81e6e22755191d961718afde8c, _"KM_PUSHPAGE->KMSLEEP when allocating chunks from their own arena". On windows, does KM_PUSHPAGE(can use reserve memory) guarantee allocation success like KM_SLEEP? If you can pls help with the usage scenario/diff for both the flags.
KM_PUSHPAGE can return NULL, KM_SLEEP will only return !NULL.
Thanks. We checked the kmem_cache 'cache_alloc_fail' counter for all the caches and value is 0. There is large free system memory available. So this doesn't seem to indicate any memory alloc failure...Anything else we can check?
Below is zio(dio) structure containing NULL abd linear buffer pointer.
26: kd> .frame 0n12;dv /t /v
0c ffff9f805add3330 fffff80d
76c8a2bc ZFSin!vdev_queue_aggregate+0x3f8 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\vdev_queue.c @ 753]
@r15 struct vdev_queue * vq = 0xffffd489`d2b3ff18
Have you come across this issue (may be in openzfs)? The source abd is ABD_FLAG_LINEAR but the abd_u.abd_linear.abd_buf( and hence iter_mapaddr) is NULL.
SYSTEM_THREAD_EXCEPTION_NOT_HANDLED (7e) This is a very common bugcheck. Usually the exception address pinpoints the driver/function that caused the problem. Always note this address as well as the link date of the driver/image that contains this address. Arguments: Arg1: ffffffffc0000005, The exception code that was not handled Arg2: fffff80d76ce7f40, The address that the exception occurred at Arg3: ffff9f805add2fb8, Exception Record Address Arg4: ffff9f805add2800, Context Record Address
Debugging Details:
Page 5c00 not present in the dump file. Type ".hh dbgerr004" for details Page 5c00 not present in the dump file. Type ".hh dbgerr004" for details Page 5c00 not present in the dump file. Type ".hh dbgerr004" for details Page 5c00 not present in the dump file. Type ".hh dbgerr004" for details Page 5c00 not present in the dump file. Type ".hh dbgerr004" for details Page 5c00 not present in the dump file. Type ".hh dbgerr004" for details
KEY_VALUES_STRING: 1
BUGCHECK_CODE: 7e
BUGCHECK_P1: ffffffffc0000005
BUGCHECK_P2: fffff80d76ce7f40
BUGCHECK_P3: ffff9f805add2fb8
BUGCHECK_P4: ffff9f805add2800
EXCEPTION_RECORD: ffff9f805add2fb8 -- (.exr 0xffff9f805add2fb8) ExceptionAddress: fffff80d76ce7f40 (ZFSin!memcpy+0x0000000000000240) ExceptionCode: c0000005 (Access violation) ExceptionFlags: 00000000 NumberParameters: 2 Parameter[0]: 0000000000000000 Parameter[1]: 00000000000003f0 Attempt to read from address 00000000000003f0
CONTEXT: ffff9f805add2800 -- (.cxr 0xffff9f805add2800) rax=ffffd48cf9fa2c00 rbx=0000000000001000 rcx=ffffd48cf9fa2c00 rdx=00002b730605d800 rsi=0000000000000000 rdi=0000000000000400 rip=fffff80d76ce7f40 rsp=ffff9f805add31f8 rbp=ffff9f805add32b0 r8=0000000000000000 r9=0000000000000020 r10=0000000000000400 r11=ffffd48cf9fa2800 r12=0000000000000000 r13=0000000000000000 r14=0000000000000000 r15=0000000000040000 iopl=0 nv up ei pl zr na po nc cs=0010 ss=0018 ds=002b es=002b fs=0053 gs=002b efl=00010246 ZFSin!memcpy+0x240: fffff80d
76ce7f40 f30f6f4411f0 movdqu xmm0,xmmword ptr [rcx+rdx-10h] ds:002b:00000000
000003f0=???????????????????????????????? Resetting default scopePROCESS_NAME: System
READ_ADDRESS: 00000000000003f0
ERROR_CODE: (NTSTATUS) 0xc0000005 - The instruction at 0x%p referenced memory at 0x%p. The memory could not be %s.
EXCEPTION_CODE_STR: c0000005
EXCEPTION_PARAMETER1: 0000000000000000
EXCEPTION_PARAMETER2: 00000000000003f0
EXCEPTION_STR: 0xc0000005
STACK_TEXT:
ffff9f80
5add31f8 fffff80d
76bfd579 : 0000000000000001 00000000
00000000 ffff9f804c9e6180 00000000
00000000 : ZFSin!memcpy+0x240 [minkernel\crts\crtw32\string\amd64\memcpy.asm @ 342] ffff9f805add3200 fffff80d
76bfde8c : 0000000000000001 fffff80d
76cdea5a 0000000000000001 fffff801
b970123c : ZFSin!abd_copy_off_cb+0x9 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\abd.c @ 1231] ffff9f805add3230 fffff80d
76bfd561 : ffffd48bf00647f8 ffffd48e
5162d040 ffffd489d2b40610 fffff80d
76cdead4 : ZFSin!abd_iterate_func2+0xf8 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\abd.c @ 1212] ffff9f805add32e0 fffff80d
76c89b14 : ffffd48bc236e540 ffffd48d
abb80050 ffffd489d2b40610 ffffd48b
ff9860a0 : ZFSin!abd_copy_off+0x25 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\abd.c @ 1251] ffff9f805add3330 fffff80d
76c8a2bc : ffffd489d2b3ff18 00000000
00000000 ffffd48be14f1f00 ffffd489
d2b3ffa0 : ZFSin!vdev_queue_aggregate+0x3f8 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\vdev_queue.c @ 753] ffff9f805add33c0 fffff80d
76c8a181 : ffffd48c5165da68 ffffd48c
0fe04861 0000000000000000 ffffd489
d2b3ff18 : ZFSin!vdev_queue_io_to_issue+0xc0 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\vdev_queue.c @ 796] ffff9f805add3410 fffff80d
76cd8d33 : ffffd48c5165da68 fffff80d
76d58240 fffff80d76d58240 ffffd489
d2b3d000 : ZFSin!vdev_queue_io_done+0xb5 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\vdev_queue.c @ 891] ffff9f805add3440 fffff80d
76cda5b8 : ffffd48c5165da68 00000000
00000001 0000000000200000 fffff80d
76bae64b : ZFSin!zio_vdev_io_done+0x83 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\zio.c @ 3593] ffff9f805add3470 fffff80d
76bb06e8 : ffffd489d21b6700 002318ac
02263d58 fffff80d76bb04d0 fffff801
b9aba3cc : ZFSin!__zio_execute+0x1a8 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\zio.c @ 1997] ffff9f805add34e0 fffff801
b97a321d : ffffd48a00000007 ffffd48a
2f200800 0000000000000080 fffff80d
76bb04d0 : ZFSin!taskq_thread+0x218 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\spl\module\spl\spl-taskq.c @ 1612] ffff9f805add3550 fffff801
b97fd886 : ffff9f804c700180 ffffd48a
2f200800 fffff801b97a31dc ffffb281
57056bb0 : nt!PspSystemThreadStartup+0x41 ffff9f805add35a0 00000000
00000000 : ffff9f805add4000 ffff9f80
5adcd000 0000000000000000 00000000
00000000 : nt!KiStartSystemThread+0x16FAULTING_SOURCE_LINE: minkernel\crts\crtw32\string\amd64\memcpy.asm
FAULTING_SOURCE_FILE: minkernel\crts\crtw32\string\amd64\memcpy.asm
FAULTING_SOURCE_LINE_NUMBER: 342
FAULTING_SOURCE_CODE:
No source found for 'minkernel\crts\crtw32\string\amd64\memcpy.asm'
SYMBOL_NAME: ZFSin!memcpy+240
MODULE_NAME: ZFSin
IMAGE_NAME: ZFSin.sys
STACK_COMMAND: .cxr 0xffff9f805add2800 ; kb
BUCKET_ID_FUNC_OFFSET: 240
FAILURE_BUCKET_ID: AV_ZFSin!memcpy
OS_VERSION: 10.0.14393.4651
BUILDLAB_STR: rs1_release
OSPLATFORM_TYPE: x64
OSNAME: Windows 10
FAILURE_ID_HASH: {1c87be51-1431-220e-9e18-388a2f2220e9}
Followup: MachineOwner
26: kd> k
Child-SP RetAddr Call Site
00 ffff9f80
5add1fe8 fffff801
b980dd55 nt!KeBugCheckEx 01 ffff9f805add1ff0 fffff801
b97e5b8f nt!PspSystemThreadStartup$filt$0+0x44 02 ffff9f805add2030 fffff801
b97fe99d nt!_C_specific_handler+0x9f 03 ffff9f805add20a0 fffff801
b974d551 nt!RtlpExecuteHandlerForException+0xd 04 ffff9f805add20d0 fffff801
b974c374 nt!RtlDispatchException+0x421 05 ffff9f805add27d0 fffff801
b9806702 nt!KiDispatchException+0x1e4 06 ffff9f805add2e80 fffff801
b9803746 nt!KiExceptionDispatch+0xc2 07 ffff9f805add3060 fffff80d
76ce7f40 nt!KiPageFault+0x406 08 ffff9f805add31f8 fffff80d
76bfd579 ZFSin!memcpy+0x240 [minkernel\crts\crtw32\string\amd64\memcpy.asm @ 342] 09 ffff9f805add3200 fffff80d
76bfde8c ZFSin!abd_copy_off_cb+0x9 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\abd.c @ 1231] 0a ffff9f805add3230 fffff80d
76bfd561 ZFSin!abd_iterate_func2+0xf8 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\abd.c @ 1212] 0b ffff9f805add32e0 fffff80d
76c89b14 ZFSin!abd_copy_off+0x25 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\abd.c @ 1251] 0c ffff9f805add3330 fffff80d
76c8a2bc ZFSin!vdev_queue_aggregate+0x3f8 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\vdev_queue.c @ 753] 0d ffff9f805add33c0 fffff80d
76c8a181 ZFSin!vdev_queue_io_to_issue+0xc0 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\vdev_queue.c @ 796] 0e ffff9f805add3410 fffff80d
76cd8d33 ZFSin!vdev_queue_io_done+0xb5 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\vdev_queue.c @ 891] 0f ffff9f805add3440 fffff80d
76cda5b8 ZFSin!zio_vdev_io_done+0x83 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\zio.c @ 3593] 10 ffff9f805add3470 fffff80d
76bb06e8 ZFSin!__zio_execute+0x1a8 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\zio.c @ 1997] 11 ffff9f805add34e0 fffff801
b97a321d ZFSin!taskq_thread+0x218 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\spl\module\spl\spl-taskq.c @ 1612] 12 ffff9f805add3550 fffff801
b97fd886 nt!PspSystemThreadStartup+0x41 13 ffff9f805add35a0 00000000
00000000 nt!KiStartSystemThread+0x16 26: kd> .frame 0n10;dv /t /v 0a ffff9f805add3230 fffff80d
76bfd561 ZFSin!abd_iterate_func2+0xf8 [C:\BuildAgent\work\88cd52027cd63d70\ZFSin\zfs\module\zfs\abd.c @ 1212]