Closed Maghoumi closed 7 years ago
Hi, thanks! This was the result using CUDA on my system, same spec as on readme.
I've pushed latest commit with CUDA integrated maybe there's something you'd like to tweak or improve further.
Hey, Oh wow these results are indeed very interesting... Thanks! 👍 I'll take a look! Thanks again : )
First, great PoC! I can't believe nobody had though of this! 👍 A quick question: I noticed your'e handling the communication via OpenCL. I was curions to know if using CUDA would've made any improvements overall. I know CUDA supports overlapping operations and tons of optimizations under the hood, and I wanted to know if you ever considered or tried that.
Also, for anybody else who may be curious about the performance on a higher end GPU, here's a screenshot of the benchmark that was done using my GTX 1080 card.
Again, great work!