Now we assume that the reduction operation for parallel_reduce is "+".
The idea is to use any kind of operation, passing the operation as a parameter.
This has to be implemented in all the backends in JACC.jl/src/JACC.jl (Threads.jl), JACC.jl/ext/JACCCUDA/JACCUDA.jl (CUDA.jl) ...
Now we assume that the reduction operation for parallel_reduce is "+". The idea is to use any kind of operation, passing the operation as a parameter. This has to be implemented in all the backends in JACC.jl/src/JACC.jl (Threads.jl), JACC.jl/ext/JACCCUDA/JACCUDA.jl (CUDA.jl) ...