xcompact3d / x3d2

https://xcompact3d.github.io/x3d2
BSD 3-Clause "New" or "Revised" License
3 stars 4 forks source link

Carry out buffer copies with a CUDA kernel. #47

Closed semi-h closed 6 months ago

semi-h commented 6 months ago

It is better for performance to run a CUDA kernel instead of using sliced copy nvfortran supports.