JuliaORNL / JACC.jl

CPU/GPU parallel performance portable layer in Julia via functions as arguments
MIT License
21 stars 13 forks source link

New optimization for parallel reduce on CUDA, AMDGPU and oneAPI using… #22

Closed pedrovalerolara closed 10 months ago

pedrovalerolara commented 10 months ago

… multiple SMs. Added new test cases for these new implemenations