triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend
Apache License 2.0
711 stars 108 forks source link

TensorRT-LLM backend v0.13 Update #607

Closed Shixiaowei02 closed 1 month ago