Open jiahaipeng opened 6 years ago
I want to know how the performance of CLBLAS on AMD embeded APU, such as RX-416GD.
I tested clBLAS on AMD RX-416GD, the performance of sgemm only up to 123GFlops. However, the peak performance of RX-416GD(GPU) is 480GFlops.
I want to know why the performance of sgemm is low or some mistakes occurs when I install clblas.
Thanks
clBLAS is not tuned or optimized for embedded APU but for discrete GPU like R9 Fury Nano. I do not think any mistakes here.
I want to know how the performance of CLBLAS on AMD embeded APU, such as RX-416GD.
I tested clBLAS on AMD RX-416GD, the performance of sgemm only up to 123GFlops. However, the peak performance of RX-416GD(GPU) is 480GFlops.
I want to know why the performance of sgemm is low or some mistakes occurs when I install clblas.
Thanks