microsoft / onnxruntime-extensions

onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
MIT License
295 stars 80 forks source link

refactor ORT-Extension for the coming GroupQueryAttention work #674

Closed jslhcl closed 3 months ago

jslhcl commented 3 months ago

Refactor ORT-Extension for the coming GroupQueryAttention work (https://github.com/microsoft/onnxruntime-extensions/pull/654). Main changes: