Closed sivagnanamn closed 6 years ago
We have look intensively at embedded GPUs two years ago and were unable to get any decent performance via OpenCL (even with our extensive autotuning framework). I claim that the GPU is simply too old and you will see better performance with more recent generations.
Below is the benchmark of ViennaCL on Adreno 330 GPU (OpenCL 1.1 embedded profile):
Performance seems to be very sub-optimal. How to tune ViennaCL GEMM for this hardware?