microsoft / onnxruntime-extensions

onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
MIT License
323 stars 84 forks source link

[WIP] Implement GroupQueryAttention from ORT #654

Open jslhcl opened 7 months ago

jslhcl commented 7 months ago

This pr is to support GroupQueryAttention op from ORT