pentium3 / sys_reading

system paper reading notes
235 stars 12 forks source link

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness #302

Open pentium3 opened 1 year ago

pentium3 commented 1 year ago

https://github.com/Dao-AILab/flash-attention