triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend
Apache License 2.0
581 stars 81 forks source link

When to expect NGC container for v0.9.0 like 24.0x-trtllm-python-py3 #430

Closed ekarmazin closed 2 months ago

ekarmazin commented 2 months ago

Hello, when should we expect NGC container with trtllm + backend for v0.9.0 to be released?

ekarmazin commented 2 months ago

thanks @byshiue NGC container is there nvcr.io/nvidia/tritonserver:24.04-trtllm-python-py3