daigo0927 / blog

Blog contents
MIT License
18 stars 5 forks source link

Compose ONNX operation with ORT extensions #75

Closed daigo0927 closed 2 years ago

daigo0927 commented 2 years ago

Sometimes we can compose pre/post-processing along with a model to serve ML predictions. ONNX Runtime extensions enable us to compose ONNX operations flexibly like: https://github.com/microsoft/onnxruntime-extensions/blob/main/tutorials/gpt2bs.py#L95 .