During using CUDAArray, I encounter the error to allocate the memory with size zero.
With debugging, I found that It occurs from calling the function "cuda_partition"
I think the line 99 should be changed as like the line 86.
cuda_check(cudaMemcpyAsync(counts_h, counts, (num_runs_h+1) * sizeof(uint32_t), cudaMemcpyDeviceToHost));
Hello, I am a big fan of this lib.
During using CUDAArray, I encounter the error to allocate the memory with size zero. With debugging, I found that It occurs from calling the function "cuda_partition"
https://github.com/mitsuba-renderer/enoki/blob/2a18afa2402e0677887c8439fa3d6a270ea15726/src/cuda/horiz.cu#L81-L103
I think the line 99 should be changed as like the line 86.
cuda_check(cudaMemcpyAsync(counts_h, counts, (num_runs_h+1) * sizeof(uint32_t), cudaMemcpyDeviceToHost));
Or it will be fixed on the next version release?