issues
search
ROCm
/
flash-attention
Fast and memory-efficient exact attention
BSD 3-Clause "New" or "Revised" License
141
stars
46
forks
source link
Enable MQA/GQA in backward
#100
Closed
micmelesse
closed
1 week ago
micmelesse
commented
1 week ago
enable mqa/gqa in backward
enable mqa/gqa in backward