codeplaysoftware / portBLAS

An implementation of BLAS using the SYCL open standard.
Apache License 2.0
250 stars 48 forks source link

Change usage of DEFAULT_CPU TUNING_TARGET to DEFAULT #515

Closed s-Nick closed 4 months ago

s-Nick commented 4 months ago

This PR changes the DEFAULT_CPU from a CPU specific TUNING_TARGET to a general DEFAULT one. Using this target allow portBLAS to run on every hardware correctly but at performance expenses.

In order to use the DEFAULT target on AMD or NVIDIA users are required to specify DPCPP_SYCL_TARGET and DPCPP_SYCL_ARCH otherwise the compiler cannot offload the kernel.

Right now, some operators don't work correctly consequently those tests are disable for that target. Tests disabled on AMD GPUs:

Tests disabled on NVIDIA GPUs: