Recently, I have been using Triton to deploy my model service and found that Triton only provides an SDK, but there is no API documentation. I need to understand the code in the SDK client to find the API, which is very unfriendly. I hope it can provide API documentation like VLLM.
Same here. It would be better if triton-server could provide SDK in addition to a standalone server, so I can integrate it with other HTTP servers like FastApi to serve HTTP requests.
Recently, I have been using Triton to deploy my model service and found that Triton only provides an SDK, but there is no API documentation. I need to understand the code in the SDK client to find the API, which is very unfriendly. I hope it can provide API documentation like VLLM.