Open bhack opened 6 months ago
@z-x-yang Any plan to reformulate one of the available attentions on using the new official pytorch SDPA? https://pytorch.org/blog/pytorch2-2/
I think that we will have a lot of speed-up and resource optimization with the underline flashattentionv2 implementation.
@z-x-yang Any plan to reformulate one of the available attentions on using the new official pytorch SDPA? https://pytorch.org/blog/pytorch2-2/
I think that we will have a lot of speed-up and resource optimization with the underline flashattentionv2 implementation.