Dear developer:
I noticed all the kernel functions used in HPXCL examples and benckmarks only have one output argument. Can HPXCL accommodate the CUDA kernels with at least 2 output arguments for exchanging with the context kernel functions?
Thanks
Li Jian
Dear developer: I noticed all the kernel functions used in HPXCL examples and benckmarks only have one output argument. Can HPXCL accommodate the CUDA kernels with at least 2 output arguments for exchanging with the context kernel functions? Thanks Li Jian