krrishnarraj / clpeak

A tool which profiles OpenCL devices to find their peak capacities
Apache License 2.0
396 stars 111 forks source link

Global size now is multiple of local size #42

Closed doe300 closed 5 years ago

doe300 commented 7 years ago

This fixes #41 by rounding the global work-size down to a multiple of the local work-size.

Possible drawback: The value is now guaranteed to be a multiple of the local work-size, not a power of it! As far as I can tell, this should have no negative consequences.