Closed 0xSSoul closed 5 years ago
cuckoo/mean.cu cuckaroo/mean.cu cuckatoo/mean.cu
cudaMemcpy(hostA, indexesE, NX * NY * sizeof(u32), cudaMemcpyDeviceToHost);
should be change to
cudaMemcpy(hostA, indexesE, sizeof(u32), cudaMemcpyDeviceToHost);
to reduce io between device and host a little?
Well spotted! This line dates back from before we added the Tail kernel for edge compaction. Will fix in next update... Thanks!
cuckoo/mean.cu cuckaroo/mean.cu cuckatoo/mean.cu
should be change to
to reduce io between device and host a little?