Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
BSD 3-Clause "New" or "Revised" License
301
stars
31
forks
source link
sync issues on AMDGPU backend #108
Closed
luraess closed 10 months ago
Running large kernels on AMDGPU backend seem to experience issues with syncing. The behaviour is similar to https://github.com/omlins/ParallelStencil.jl/pull/107#discussion_r1271285037.
The issue is that
AMDGPU.HIPStream
constructor is used instead of getting the default stream for the current taskAMDGPU.stream()
.