ROCm / flash-attention

Fast and memory-efficient exact attention
BSD 3-Clause "New" or "Revised" License
141 stars 46 forks source link

Enable MQA/GQA in backward #100

Closed micmelesse closed 1 week ago

micmelesse commented 1 week ago

enable mqa/gqa in backward