Open reid3333 opened 1 year ago
Same DELL 4700U 512MB No bios update support.
I think I might be having this same issue when using lshqqytiger/stable-diffusion-webui-directml. Same issue as referenced above: https://github.com/lshqqytiger/stable-diffusion-webui-directml/issues/6
Quick copy paste: The BSOD is a bugcheck code 370 (VIDEO_MEMORY_MANAGEMENT_INTERNAL) with a parameter 0x36 which according to microsoft means "The paging request failed on a paging packet or device resume that was previously marked as unrecoverable, and was expected to succeed subsequent calls." (see: https://learn.microsoft.com/en-us/windows-hardware/drivers/debugger/bug-check-0x10e---video-memory-management-internal ). No idea what it means.
Windows 10, AMD 5700G with pretty recent drivers.
Looking in Windows Task Manager, shared GPU memory usage goes up to around 7-8 GB or so (might be remembering wrong since it happens fast) out of the 16GB that are shown as max, and then I get the blue screen.
I believe my BSoD is related, I'm having the same stop code as @TomArrow:
Quick copy paste: The BSOD is a bugcheck code 370 (VIDEO_MEMORY_MANAGEMENT_INTERNAL) with a parameter 0x36 which according to microsoft means "The paging request failed on a paging packet or device resume that was previously marked as unrecoverable, and was expected to succeed subsequent calls." (see: https://learn.microsoft.com/en-us/windows-hardware/drivers/debugger/bug-check-0x10e---video-memory-management-internal ).
My GPU is a Vega FE, so 16 GB of VRAM shouldn't be having any issues with low memory but I can't even complete a 512² diffusion with --medvram
and no XL will run period even with --lowvram
.
What information is useful? I have some sense that this is a driver level issue, something in perhaps ADL or something to do with resizeable BAR. Then again, when isn't the issue a driver…
When I don't BSoD I get other memory related errors and have to close and reopen the console, restarting the UI doesn't do anything. I've been banging my head against the wall on this for a few days now, I'm not sure what to try next.
BSOD occurs when creating many small tensor in Ryzen iGPU environment and dedicated GPU memory is small.
Code
Note: The above code is designed to execute all test cases sequentially for ease of viewing, but each test case actually runs individually.
Testing Environments 1
Testing Environments 2
Result
All test case results are the same in both test environments.
device=cpu
All test cases are works.
device=directml and small Dedicated GPU Memory (512MB)
device=directml and large Dedicated GPU Memory (2048MB)
All test cases are works.
The same result was obtained in an environment with twice the total amount of memory, so I do not think that lack of memory is the cause.
BSOD report from NirSoft BlueScreenView
The error corresponding to bug check code
10e
isVIDEO_MEMORY_MANAGEMENT_INTERNAL
https://learn.microsoft.com/ja-jp/windows-hardware/drivers/debugger/bug-check-0x10e---video-memory-management-internalSupplementary Information