pentium3 / sys_reading

system paper reading notes
229 stars 12 forks source link

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness #302

Open pentium3 opened 8 months ago

pentium3 commented 8 months ago

https://github.com/Dao-AILab/flash-attention