oneapi-src / oneDPL

oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html
Apache License 2.0
720 stars 114 forks source link

Single WG implementation of `__parallel_find_or` #1736

Closed SergeyKopienko closed 1 month ago

SergeyKopienko commented 1 month ago

In this PR we prepare single work-group implementation of __parallel_find_or :

This approach gives us a big performance boost for small data sizes.

SergeyKopienko commented 1 month ago

@julianmi , @danhoeflinger , @adamfidel have somebody additional comments for this PR? Looks like all comments has been fixed now.