Open AaronGhost opened 1 month ago
Good catch! IIRC we introduce these for simplicity of the kernel implementation: https://github.com/JuliaGPU/CUDA.jl/blob/a0aa8b8c142f5eab9db0889802fba9636bdb454b/src/mapreduce.jl#L183-L187
Probably need to reshape them out, or keep the original input around for returning.
Hi, thanks again for putting CUDA.jl together!
I found that the return type of the
sum!
function can be different betweenArray
andCuArray
: theArray
return type is the same as the left argument while theCuArray
return type retains the singleton dimension.I would expect to get the same type as
Y_d
in this case.Version info