oneAPI DPC++ Library (oneDPL) https://software.intel.com/content/www/us/en/develop/tools/oneapi/components/dpc-library.html
715
stars
112
forks
source link
Using `fetch_min` / `fetch_max` in `__parallel_find_or` and in it's stuff #1650
Closed
SergeyKopienko closed 1 week ago
In this PR we using
fetch_min
/fetch_max
in__parallel_find_or
and it's stuff. This approach give us additional performance boost:for
-based loops with additional checks.This idea was proposed by @danhoeflinger.
Also in this PR we using
__dpl_ceiling_div
function where it's applicable in the code.