Closed technillogue closed 4 months ago
triton 0.10.0 has a breaking change. this PR tries to support it.
https://github.com/NVIDIA/TensorRT-LLM/releases/tag/v0.10.0#:~:text=The%20input%20prompt%20was%20removed%20from%20the%20generation%20output
triton 0.10.0 has a breaking change. this PR tries to support it.
https://github.com/NVIDIA/TensorRT-LLM/releases/tag/v0.10.0#:~:text=The%20input%20prompt%20was%20removed%20from%20the%20generation%20output