bkloppenborg / liboi

OpenCL Interferometry Library
https://github.com/bkloppenborg/liboi/wiki
GNU Lesser General Public License v3.0
5 stars 6 forks source link

normalize_float.cl speedup #21

Closed bkloppenborg closed 10 years ago

bkloppenborg commented 11 years ago

The normalize_float.cl kernel could be accelerated by computing the 1/x and storing it into local/global memory. Right now this kernel achieves ~60% occupancy and accounts for < 0.1% of GPU time. Therefore this is quite low in terms of priority.

bkloppenborg commented 10 years ago

Closed in 89c82c0.