triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend
Apache License 2.0
664 stars 96 forks source link

nvcr.io/nvidia/tritonserver:24.06-trtllm-python-py3 missing tritonserver binary? #516

Closed snc-mana closed 3 months ago

snc-mana commented 3 months ago

I tried running the latest Docker image of the triton server with the latest trtllm backend. I cannot see the tritonserver binary which existed in the previous image version 24.05-trtllm-python-py3. Also the image size is suspiciously way smaller 10GB vs 18GB.