ROCm / flash-attention

Fast and memory-efficient exact attention
BSD 3-Clause "New" or "Revised" License
141 stars 46 forks source link

Ck tile/flash attention #61

Closed rocking5566 closed 5 months ago

rocking5566 commented 5 months ago

Integrate batch and group mode FA fwd.