triton-inference-server / onnxruntime_backend

The Triton backend for the ONNX Runtime.
BSD 3-Clause "New" or "Revised" License
121 stars 54 forks source link

Is onnxruntime-genai supported? #251

Open jackylu0124 opened 4 months ago

jackylu0124 commented 4 months ago

Hey all, I have a quick question, is onnxruntime-genai (https://onnxruntime.ai/docs/genai/api/python.html) supported in Triton Inference Server's ONNX runtime backend? I couldn't find relevant sources in the documentation. Thanks in advance!

Ben-Epstein commented 4 months ago

I came here for the same question! 😄