c3sr / comm_scope

NUMA-aware multi-CPU multi-GPU data transfer benchmarks
https://github.com/c3sr/scope
Apache License 2.0
21 stars 3 forks source link

Investigate using cudaLaunchHostFunc for getting wall time when a stream operation ends #28

Open cwpearson opened 5 years ago