[LLVMGPU] Generalize AMDGPUChainedMatmul pass to multiple dimensions

iree-org / iree

A retargetable MLIR-based machine learning compiler and runtime toolkit.

http://iree.dev/

Apache License 2.0

2.47k stars 548 forks source link

Closed Groverkss closed 1 week ago

Groverkss commented 1 week ago

This patch generalizes the AMDGPUChainedMatmul pass to use VectorContractOpInfo to query and transpose dims, instead of hardcoding indexing maps.

Groverkss commented 1 week ago

Since this introduces a n, m matmul, I need to check if attention still works with this patch.

Groverkss commented 1 week ago

Actually, CI seems to be passing on this, so I'm going to land this