In the comments re issue #641 , I sketch out an approach to unifying the copy functions involving optional_ref<const stream_t> for the same wrapper API call usable both for async and non-async variants of CUDA's own APIs. This is also applicable to other operations in the cuda::memory::async namespace: set(), zero(), allocate(), free(). If we adopt that approach for copying, let's also have it for these operations.
In the comments re issue #641 , I sketch out an approach to unifying the copy functions involving
optional_ref<const stream_t>
for the same wrapper API call usable both for async and non-async variants of CUDA's own APIs. This is also applicable to other operations in thecuda::memory::async
namespace:set()
,zero()
,allocate()
,free()
. If we adopt that approach for copying, let's also have it for these operations.