Closed Behemyth closed 7 years ago
Surface memory exists as cudaArrays. This memory is not atomically modifiable. They do not have coherent access if it has been written to in the same kernel. This means that we will have to use global memory with custom locality indexing.
Modify the RayJob results to reside in texture memory instead of global memory. Should get a performance increase and allow better filtering.