SKKU-ESLAB / ANT

ANT (AI-based Networked Things) Framework
Apache License 2.0
26 stars 28 forks source link

SpMM optimization - execution time #85

Closed GLeeGend closed 4 years ago

GLeeGend commented 5 years ago

ACL_GEMM (1024 x 1024) execution time : 10ms CSR_FLEX_GEMM (1024 x 1024) execution time : 60ms (best performance)

The difference between these two performances seems to be the tiling optimization technique.   To get better performance than ACL_GEMM, consider about tile-based SpMM or use other sparse format.

rhkdqo93 commented 4 years ago

Apply 2D group-level pruning can handle this issue. We do not consider the above issues. Closed.