projectNe10 / Ne10

An open optimized software library project for the ARM® Architecture
Other
1.46k stars 408 forks source link

sgemm on Ne10 #200

Open SingleDavid opened 6 years ago

SingleDavid commented 6 years ago

Only 22, 33, 4*4 sgemm are supported on Ne10, what the other sizes of matmul?

Phillip-Wang commented 6 years ago

The sgemm is an important function in BLAS. Ne10 does not provide it but there are some handy functions that can be combined into a sgemm.

deeptibhegde commented 5 years ago

Can you elaborate on those other functions to implement sgemm? Is mat_mul the only sgemm equivalent?