issues
search
dzhoshkun
/
cuda-learning
BSD 3-Clause "New" or "Revised" License
1
stars
0
forks
source link
Check speed-up with CUDA streams
#3
Open
dzhoshkun
opened
6 years ago
dzhoshkun
commented
6 years ago
Pre-requisite:
[ ]
https://devblogs.nvidia.com/how-overlap-data-transfers-cuda-cc/
As a continuation of #1:
[ ] implement a streamed version of the
add
function
[ ] call it with e.g. 22 streams (GTX 980 Ti) and check the speed-up against the other versions
Pre-requisite:
As a continuation of #1:
add
function