Closed EwoutH closed 5 months ago
I don't have direct experience with Triton-Inference-Server, I'll look into it in the nex days
@EwoutH I think you confused OpenAI Triton (the Language) with Nvidia Triton (a API server in C++)
Right, from the Readme I didn’t figure that. Thanks for clearing that up!
Would it be possible to create a Triton backend from this implementation?