codeplaysoftware / portBLAS

An implementation of BLAS using the SYCL open standard.
Apache License 2.0
250 stars 48 forks source link

[Draft] Added joint_matrix for Intel PVC and ARC #507

Open muhammad-tanvir-1211 opened 5 months ago

muhammad-tanvir-1211 commented 5 months ago

Only works for trans_a = n and trans_b = n case. Hasn't been tested for corner cases either.