Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
BSD 3-Clause "New" or "Revised" License
312
stars
31
forks
source link
Add support for explicit use of selected low level kernel language constructs in `@parallel_indices` kernels #43
Closed
omlins closed 2 years ago
Concretely, it adds support for:
│ Advanced │ │ • @gridDim │ │ • @blockIdx │ │ • @blockDim │ │ • @threadIdx │ │ • @sync_threads │ │ • @sharedMem