omlins / ParallelStencil.jl

Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
BSD 3-Clause "New" or "Revised" License
311 stars 31 forks source link

Fix AMDGPU shared memory allocation #93

Closed omlins closed 1 year ago

omlins commented 1 year ago

Correct allocation now possible after fix: https://github.com/JuliaGPU/AMDGPU.jl

Also includes a workaround for AMDGPU.rand not yet supporting Enums: https://github.com/JuliaGPU/AMDGPU.jl/issues/429