At least the v2 version of miniBUDE (have not looked at the other one) uses --use_fast_math with CUDA, while not using -ffast-math with SYCL. This causes results to not be comparable between the two. Fast math significantly impacts performance for this app.
Note that this does not impact icpx which uses -ffast-math by default.
At least the v2 version of miniBUDE (have not looked at the other one) uses
--use_fast_math
with CUDA, while not using-ffast-math
with SYCL. This causes results to not be comparable between the two. Fast math significantly impacts performance for this app.Note that this does not impact
icpx
which uses-ffast-math
by default.