microsoft / onnxruntime-extensions

onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
MIT License
295 stars 80 forks source link

Introduce flash attention and cutlass library #708

Closed jslhcl closed 1 month ago

jslhcl commented 2 months ago

Introduce flash attention and cutlass library