yukarinoki / reseach

0 stars 0 forks source link

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning #34

Open yukarinoki opened 1 year ago

yukarinoki commented 1 year ago

https://arxiv.org/abs/2307.08691

yukarinoki commented 1 year ago

性能比較

https://github.com/yukarinoki/flash-attention#h100

以下はH100での性能グラフ image