jeffwong / cudaglmnet

A lasso solver on the GPU
11 stars 3 forks source link

copy functions might perform better using coalescing #2

Closed jeffwong closed 11 years ago

jeffwong commented 11 years ago

may be better to use cudaMemCopy and take advantage of memory coalescing instead of using a kernel to copy elemnts one by one