triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/k8s-device-plugin #446

Limitations in Using Multiple MIG Instances in a Container

Hi, I am curious about how different it is to use multiple MIG instances instead of multiple no-mig GPUs (such as V100) in terms of paralleling, memory sharing etc. I didn't receive same outputs in…

tunahanertekin updated 1 month ago
7
triton-inference-server/server #7460

TRITON with Pytorch CPU only build not working

**Description** Triton Server with Pytorch Backend build not working for CPU_ONLY. It is expecting libraries like libcudart.so even though the build was for CPU. Below is how we invoke the build. Fro…

ndeep27 updated 1 month ago
14
triton-inference-server/server #7386

Triton Rust Crate as In-Process Inference Engine

**Is your feature request related to a problem? Please describe.** Rust API for Triton Server to integrate Triton in-process with a Rust Server Rust is now a universally recommended language to deve…

asamadiya updated 1 month ago
2
triton-inference-server/perf_analyzer #68

Unable to run performance analyzer on my model - Request for…

Unable to run performance analyzer on my model I am using a sagemaker wrapper image of triton server and am able to serve the model with requests and even validate that it is up, all ports for grpc, …

vijetha35 updated 2 weeks ago
8
triton-inference-server/client #731

Memory leak from grpcio

I tested `tritonclient:2.43.0` on Ubuntu:22.04 with `grpcio:1.62.1` and was confronted with a memory leak. Example for reproduction: ``` import asyncio from tritonclient.grpc.aio import Inferen…

AlexanderKomarov updated 3 days ago
1
sgl-project/sglang #1240

[Bug] subprocess.CalledProcessError: Command '['/usr/bin/gcc…

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. - [X] 3. Please note that if the bug-related issue y…

ArtificialZeng updated 2 weeks ago
2
open-mmlab/mmdeploy #870

SDK in Triton Inference Server

Hi, I'm thinking about using the MMdeploy SDK as a backend in the [Triton server](https://github.com/triton-inference-server). It seems that many people would be interested in this usage. Do you h…

Mo-Kanya updated 2 years ago
2
triton-inference-server/server #7580

I don't know what to do.

**Description** A clear and concise description of what the bug is. ![output_image](https://github.com/user-attachments/assets/bed4e808-a3e0-4225-96c4-04ae69c65a15) **Triton Information** …

choi119 updated 2 weeks ago
6
triton-inference-server/server #7583

TritonSever does not register vLLM metrics

**Description** A clear and concise description of what the bug is. I'm running Triton Inference Server with vLLM backend as a container on Kubernetes. I followed the [Triton metrics documentatio…

ratnopamc updated 3 weeks ago
7
triton-inference-server/server #6796

Unfixed bugs：issue/5783, Inaccurate request handling when co…

**Description** i want to use the model's queue policy(max queue length and timeout),but i found triton does not handle requests in the accurate too,and i found this issue https://github.com/triton-i…

eeeeeunjung updated 3 weeks ago
6

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for triton-server

1000+ results
for triton-server