Closed newling closed 12 hours ago
Now prints performance numbers for shapes
(M, N, K) = (512, 512, 4096) and (512, 4096, 512) and (4096, 512, 512)
with and without ukernels. Numbers with ukernels are interesting, take a look in CI, different gain with ukernel for the different shapes.
But looks like there's some improvement we can make on the vectorization of matmul front.
Any idea of the windows failure?
Yes I'm 90% sure it'll be fixed after rebasing on https://github.com/nod-ai/iree-amd-aie/pull/911 (windows + ukernel = doesn't work)
Now prints performance numbers for shapes
(M, N, K) = (512, 512, 4096) and (512, 4096, 512) and (4096, 512, 512)
with and without ukernels. Numbers with ukernels are interesting, take a look in CI, different gain with ukernel for the different shapes.
But looks like there's some improvement we can make on the vectorization of matmul front.