issues
search
pentium3
/
sys_reading
system paper reading notes
235
stars
12
forks
source link
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
#302
Open
pentium3
opened
1 year ago
pentium3
commented
1 year ago
https://github.com/Dao-AILab/flash-attention
https://github.com/Dao-AILab/flash-attention