KhronosGroup / Vulkan-Samples

One stop solution for all Vulkan samples
Apache License 2.0
4.23k stars 634 forks source link

Updated Timeline Semaphore sample #1125

Closed bryce-young-mobica closed 1 week ago

bryce-young-mobica commented 1 month ago

Description

Reworked the timeline semaphore sample to prevent it crashing on Windows.

The only obvious trigger for the crash that I could observe was the main thread calling "vkDeviceWaitIdle" whilst the compute thread was in "wait_timeline_gpu". To avoid this, I removed the "wait/signal_timeline_gpu" calls (opting to attach the "VkTimelineSemaphoreSubmitInfo" to the queue submissions instead), and restructured the compute/graphics work stages to prevent the compute thread running ahead and submitting (potentially blocking) work.

Fixes #588

Tested on Windows Tested using seperate and shared queue for compute and graphics

General Checklist:

Please ensure the following points are checked:

Sample Checklist

If your PR contains a new or modified sample, these further checks must be carried out in addition to the General Checklist:

bryce-young-mobica commented 1 month ago

But I get an error about every second time I start this demo: Validation Error: [ UNASSIGNED-CoreValidation-DrawState-InvalidImageLayout ] Object 0: handle = 0x243ed7ef060, type = VK_OBJECT_TYPE_COMMAND_BUFFER; Object 1: handle = 0x944a2c0000000039, type = VK_OBJECT_TYPE_IMAGE; | MessageID = 0x4dae5635 | vkQueueSubmit(): pSubmits[0].pCommandBuffers[0] command buffer VkCommandBuffer 0x243ed7ef060[] expects VkImage 0x944a2c0000000039[] (subresource: aspectMask 0x1 array layer 0, mip level 0) to be in layout VK_IMAGE_LAYOUT_SHADER_READ_ONLY_OPTIMAL--instead, current layout is VK_IMAGE_LAYOUT_UNDEFINED.

I saw that error during development (iirc it was related to the compute work starting before the setup had finished), and I resolved it by moving the initial "game of life" setup out of the compute thread.

How are you running the tests? (I've been unable to reproduce it myself using the VS debugger or Vulkan Configurator)

SaschaWillems commented 1 month ago

I can reproduce that message, but only randomly. I did run the sample many times (latest commit) and only saw that validation error twice. It's triggered from here:

image

Since it's so random (sometimes takes more than 10 runs to show up) it's pretty hard to debug.

P.S. : I'm running a debug build on Win11 with VS 2022 and using the latest SDK.

asuessenbach commented 1 month ago

I can repro the validation layer error on about every second run, on Win10, VS2022, NVIDIA RTX A3000 Laptop GPU.

It's always the the shared.images[0], created on line 122. If I wait_on_timeline(Timeline::draw); in do_graphics_work() when timeline.frame == 0, the issue seems to disappear. But I have no idea, what synchronization is missing or failing to make that a requirement.

bryce-young-mobica commented 1 month ago

It's always the the shared.images[0], created on line 122. If I wait_on_timeline(Timeline::draw); in do_graphics_work() when timeline.frame == 0, the issue seems to disappear. But I have no idea, what synchronization is missing or failing to make that a requirement.

Thanks, I think that is fixing the issue by ensuring the compute commands are submitted first (image[1] is initialised in the "setup_game_of_life" function, but image[0] was only being initialised on that first submission). I've updated the "setup_game_of_life" function to run both VkImages through the compute "init_pipeline".

@asuessenbach please could you try the latest patch and let me know if it helps?

asuessenbach commented 1 month ago

Yep, that seems to fix the issue. Note, though, that you could handle all NumSyncFrames images in one submit, like so: 0001-Get-image-initializations-in-one-command-buffer.patch

marty-johnson59 commented 1 week ago

Merging - 3 approvals