CNugteren / CLBlast

Tuned OpenCL BLAS
Apache License 2.0
1.06k stars 202 forks source link

About the performance in different matrix layouts #553

Closed diverger closed 4 months ago

diverger commented 4 months ago

I'm porting a project based on cuBlas to clblast. As known, the former assumes all matrices are column majored. But the latter has row major as the default layout in the C++ API. Does it means it will perform better with row major matrices?

CNugteren commented 4 months ago

No, not necessarily. This will differ per device. You can run the included 'clients' to do benchmarks if you want.