ROCm / flash-attention

Fast and memory-efficient exact attention
BSD 3-Clause "New" or "Revised" License
131 stars 41 forks source link

Feature/seqlenq ngroups swap #62

Closed poyenc closed 3 months ago