triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

xhp-hust-2018-2011/SS-DCNet #8

NVIDIA Triton Inference Server Compatibility - Tracing the S…

Hi @xhp-hust-2018-2011 , Thanks for the great work done on this repo. I'm trying to use your prebuilt Pytorch model with [NVIDIA's Triton Inference Server](https://docs.nvidia.com/deeplearning/sdk/…

mohammedayub44 updated 4 years ago
2
triton-inference-server/server #7052

Version with -1 makes the triton inference server - python …

**Description** A clear and concise description of what the bug is. r23.04 ``` I0718 11:39:24.385839 1 server.cc:653] | Model | Version | Status …

Kanupriyagoyal updated 5 months ago
2
triton-inference-server/tensorrtllm_backend #349

Triton Server for Mixtral fails non-deterministically with a…

### System Info CPU - x86_64, Intel(R) Xeon(R) CPU @ 2.20GHz CPU memory - 1.3TB GPUs - Nvidia A100 80GB git commit ID of (TensorRT LLM backend): e432c6a0cc85f9790365067e7e3175e1b2ce3559 TRT-LLM …

vinod-sarvam updated 6 months ago
3
tritonmc/Triton #206

MySQL not working on Velocity

**Describe the bug** When using Triton with Velocity, Triton can not connect to MySQL server And on paper it connected normally **To Reproduce** 1.Using Triton with Velocity 2.Config Triton to…

NetheriteTree updated 10 months ago
5
triton-inference-server/server #3926

Does triton-inference-server run on Drive AGX?

**Is your feature request related to a problem? Please describe.** 1. We would like to try parallel model execution on iGPU+DLA devices. Is it possible to run triton-inference-server on a V3NP or Ori…

jayxio updated 2 years ago
1
triton-inference-server/server #6644

Allow for explicit folder name when specifying where remote …

**Is your feature request related to a problem? Please describe.** When a model repository is downloaded from a remote location there are possible references to these files that are needed to be expl…

markthill updated 2 weeks ago
9
triton-inference-server/server #6800

Signal 6 or Signal 11 from python backend.

**Description** In k8s cluster I have with multiple GPUs and a single Triton server's pod with multiple models including BLS based models. Sometimes under heavy pressure triton restarts with Sig…

kbegiedza updated 7 months ago
2
triton-inference-server/server #6494

nv_inference_pending_request_count metric exported in 23.09 …

**Description** The `nv_inference_pending_request_count` metric exported by tritonserver is incorrect in ensemble_stream mode. The ensemble_stream pipeline contains 3 steps: preprocess, fastertra…

hxer7963 updated 1 month ago
2
NVIDIA/TensorRT-LLM #1868

llama2 runs normally only on adjacent gpus

### System Info tensorrt-llm version 0.11.0.dev2024062500 Architecture: x86_64 AMD EPYC 9354 32-Core Processor ``` txt +----------------------------------------------------------…

janpetrov updated 1 month ago
7
opendatahub-io/modelmesh-serving #184

[P0] [SPIKE] - OOTB support for NVIDIA Triton Inference Serv…

From req doc: **OOTB support for NVidia Triton Inference Server** - We are going with OpenVINO right now as Triton can not be built right now due to maintenance concerns. Acceptance criteria: - Scope…

heyselbi updated 9 months ago
3

上一页 1...20 21 22 23 24 25 26...100 下一页

1000+ results for triton-server

1000+ results
for triton-server