triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend
Apache License 2.0
664 stars 96 forks source link

Update TensorRT-LLM backend #384

Closed Shixiaowei02 closed 6 months ago