Why preempt reset the GPU need to execute `be_stream_device_queue_cap` times?

flyflypeng commented 1 year ago

https://github.com/SJTU-IPADS/reef/blob/0a25de5d60edaef524752a921a8c72e131137879/src/reef/server/scheduler.cpp#L332

I found the GPUResetCU() function actually call ioctl kfd_ioctl_wave_reset_args function to reset all the weavfronts, so why this place need to reset be_stream_device_queue_cap times?

francis0407 commented 1 year ago

This is just a small "optimization" to make sure the running kernels are killed.

Consider a naive implementation of the preemption:

reset host queue
set preemption flag (using preemption stream)
wait for preemption stream
reset the CUs (after this step, all the kernel will detect the preemption flag as "true")
wait for the execution streams if necessary

Actually, the third step "wait for preemption stream" may take several microseconds. To overlap this operation with other steps, we exchange the third and fourth steps. See: https://github.com/SJTU-IPADS/reef/blob/0a25de5d60edaef524752a921a8c72e131137879/src/reef/server/scheduler.cpp#L335

However, this may lead to another problem: the preemption flag might not have been set to "true" when the running kernel is killed. Therefore, to make sure the all the kernels are either killed or evicted by the preemption flag, we repeat the "ResetCU" call several times here.

flyflypeng commented 1 year ago

Thanks for your detailed explanation! But I still have a question, who checks the preempt_flag of best effort task? the amdgpu driver or HIP runtime?

preempt_flag is passed as one of the kernel parameters, but I can't find the place to process this flag. https://github.com/SJTU-IPADS/reef/blob/0a25de5d60edaef524752a921a8c72e131137879/src/reef/executor/hybrid_executor.cpp#L83

francis0407 commented 1 year ago

The preemption flag is checked in the device code (i.e., GPU kernels).

https://github.com/SJTU-IPADS/reef/blob/0a25de5d60edaef524752a921a8c72e131137879/resource/mocked_kernel/mocked_kernel.be.cu#L30

flyflypeng commented 1 year ago

Thanks a lot!

SJTU-IPADS / reef

Why preempt reset the GPU need to execute `be_stream_device_queue_cap` times? #2