usnistgov / hiperc

High Performance Computing Strategies for Boundary Value Problems
https://pages.nist.gov/hiperc/en/latest/index.html
39 stars 7 forks source link

unroll convolution #134

Open tkphd opened 6 years ago

tkphd commented 6 years ago

Inner loops on CUDA convolution code should run faster using a #pragma unroll statement.