c3sr / comm_scope

NUMA-aware multi-CPU multi-GPU data transfer benchmarks
https://github.com/c3sr/scope
Apache License 2.0
21 stars 3 forks source link

prefetch-duplex GPU/GPU may be able to associate both streams with a single device #22

Open cwpearson opened 5 years ago

cwpearson commented 5 years ago

If so, we would not need to measure the cost of cudaStreamSynchronize()