devinamatthews / tblis

TBLIS is a library and framework for performing tensor operations, especially tensor contraction, using efficient native algorithms.
BSD 3-Clause "New" or "Revised" License
113 stars 27 forks source link

Is the matrix operation of this library as fast as MKL #34

Open matrix97317 opened 3 years ago

devinamatthews commented 3 years ago

The matrix multiplication primitives are essentially the same as in BLIS; you can find lots of performance graphs for BLIS here. It is typically as fast or faster than OpenBLAS and about 10% slower than MKL. Of course, TBLIS's forte is tensor operations, which are not natively available in MKL and are much slower than TBLIS when implemented using tensor transpose+matrix multiplication (see here).