By stopping to make a view of NArray at #[], we can remove NDF_INDEX_LOOP, or all of codes using idx internally, and results in making very simple implementation.
This is a kind of feedback from Cumo.
In Cumo, idx C array must be transferred from CPU to GPU when we want to use it in CUDA Kernel.
Also, we have to copy back from GPU to CPU when we want to use it in CPU side.
It made implementation of Cumo very complex. I want to stop using idx.
In Numo, advanced indexing returns a view of a Numo NArray.
In Numpy, advanced indexing returns a new ndarray.
I propose Numo to work similarly with NumPy.
By stopping to make a view of NArray at #[], we can remove NDF_INDEX_LOOP, or all of codes using idx internally, and results in making very simple implementation.
This is a kind of feedback from Cumo. In Cumo,
idx
C array must be transferred from CPU to GPU when we want to use it in CUDA Kernel. Also, we have to copy back from GPU to CPU when we want to use it in CPU side.It made implementation of Cumo very complex. I want to stop using idx.