Closed gavrissh closed 9 months ago
@yuzisun Wanted to follow up, if the current state of changes are alright?
I have updated with all the recent discussed changes
We should probably add the option to return log probabilities in the result. This seems to be fairly common among other APIs. This would comprise a boolean logprobs
parameter in the request and a corresponding logprobs
property in the response containing an array of objects with keys token
and logprob
.
We should probably add the option to return log probabilities in the result. This seems to be fairly common among other APIs. This would comprise a boolean
logprobs
parameter in the request and a correspondinglogprobs
property in the response containing an array of objects with keystoken
andlogprob
.
I have updated the PR to support the above items
Thanks @gavrishp !! Great job on getting this going with the initial version.
/lgtm /approve
[APPROVALNOTIFIER] This PR is APPROVED
This pull-request has been approved by: gavrishp, yuzisun
The full list of commands accepted by this bot can be found here.
The pull request process is described here
Propose generate rest api endpoints
[
](url)
Reference - https://docs.nvidia.com/deeplearning/triton-inference-server/user-guide/docs/protocol/extension_generate.html#generate-extension