triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend
Apache License 2.0
581 stars 81 forks source link

TensorRT-LLM backend v0.10 update #492

Closed kaiyux closed 3 weeks ago