triton-inference-server Search Results

1000+ results
for triton-inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/server #7365

Handling Unsupported Input and Ensuring GPU Processing in Tr…

I have configured an ensemble model in Triton Inference Server, which includes DALI preprocessing and TensorRT inference. When I uploaded a GIF image from the client, the Triton server crashed with th…

Bycqg updated 3 months ago
1
triton-inference-server/tensorrtllm_backend #475

[Question] Best practises to track inputs and predictions?

Hello, I am seeking advice on the best practices for tracking all inputs and predictions made by a model when using Triton Inference Server. Specifically, I would like to track every interaction th…

FernandoDorado updated 4 months ago
2
open-mmlab/mmdeploy #855

How to inference with model converted by MMdeploy?

Hi, I am trying to use MMpose in the Nvidia triton server but it does not support PyTorch model, it supports torchscript and ONNX, and a few others. So, I have converted MMpose mobilenetv2 model to…

Monalsingh updated 2 months ago
3
sgl-project/sglang #35

Triton support

Hello, curious if we can already use sglang as a backend for NVIDIA's Triton Server. Amazing work with the library btw, love it!

TheodoreGalanos updated 3 weeks ago
7
triton-inference-server/tensorrtllm_backend #596

request is blocked and non output when using tensor parallel…

### System Info NVIDIA 2*L20 launch triton server with tensorrt-llm backend v0.12.0 in a container ### Who can help? _No response_ ### Information - [ ] The official example scripts -…

dwq370 updated 4 weeks ago
1
triton-inference-server/server #6998

Inquiry Regarding Triton Inference Server and PyTorch Integr…

Hello. I am writing to inquire about the PyTorch version used in the Triton Inference Server 24.01 release. Upon reviewing the documentation, I noticed that Triton 24.01 includes PyTorch version…

luvpine updated 6 months ago
3
triton-inference-server/model_navigator #33

TensorRT-LLM Triton Backend Support

When can NAV support creating Triton Repo for this new backend? Is it on your roadmap? https://github.com/triton-inference-server/tensorrtllm_backend

shixianc updated 2 months ago
6
triton-inference-server/tensorrtllm_backend #75

is support multi node in triton inference server?

is support multi node in triton inference server? i build llama-7b for tensorrtllm_backend and execute triton inference server i have a 4 GPUS but triton inference server load only 1 GPUS imag…

amazingkmy updated 11 months ago
4
triton-inference-server/server #7650

Big performance drop when using ensemble model over separate…

**Description** We have an ensemble of 2 models chained together (description of models below). Calling only the "preprocessing" model yields a max throughput of 21500 QPS @ 6 Cpu cores usage Cal…

jcuquemelle updated 1 week ago
1
triton-inference-server/client #731

Memory leak from grpcio

I tested `tritonclient:2.43.0` on Ubuntu:22.04 with `grpcio:1.62.1` and was confronted with a memory leak. Example for reproduction: ``` import asyncio from tritonclient.grpc.aio import Inferen…

AlexanderKomarov updated 2 weeks ago
1

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for triton-inference-server

1000+ results
for triton-inference-server