Can We implement Flash attention 2 in MXnet

apache / mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

https://mxnet.apache.org

Apache License 2.0

20.78k stars 6.79k forks source link

Can We implement Flash attention 2 in MXnet #21222

Open rajveer43 opened 1 year ago

rajveer43 commented 1 year ago

Description

Flash Attention 2 is a library that provides attention operation kernels for faster and more memory efficient inference and training:

References

list known implementations

github-actions[bot] commented 1 year ago

Welcome to Apache MXNet (incubating)! We are on a mission to democratize AI, and we are glad that you are contributing to it by opening this issue. Please make sure to include all the relevant context, and one of the @apache/mxnet-committers will be here shortly. If you are interested in contributing to our project, let us know! Also, be sure to check out our guide on contributing to MXNet and our development guides wiki.