omlins / ParallelStencil.jl

Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
BSD 3-Clause "New" or "Revised" License
311 stars 31 forks source link

Add initial implementation of loop based optimizations #66

Closed omlins closed 1 year ago

omlins commented 1 year ago

Generalization to more cases will needed to be done in future PRs and heuristics and more need to be refined. Thus, the new features do not yet enter the official public API.