triton-inference-server Search Results

1000+ results
for triton-inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/server #7154

On server/deploy/oci -> running "helm install example ." to …

On server/deploy/oci - running "helm install example ." to deploy the Inference Server and pod doesn't get to running due to Liveness probe failed & Readiness probe failed. Below describe log detai…

aviv12825 updated 2 months ago
1
NVIDIA/TensorRT #3896

slower inference speed of TensorRT 10.0 on GPU Tesla T4

## Description I convert nafnet from onnx to tensorrt on Tesla T4 with TensorRT 10.0. However, the inference speed is much slower than engine converted from TensorRT 8.6. TensorRT 10.0: [05/24…

HSDai updated 4 weeks ago
9
triton-inference-server/server #7068

Docker images have repeated layers

**Problem: GKE image streaming will not work with these images due to repeated layers* I would like to use GKE image streaming with triton-inference-server images. This feature will only work if…

TheCodeWrangler updated 1 day ago
8
npuichigo/openai_trtllm #47

Feature request - Add all v1/ routes

@npuichigo I am trying to use [Triton Inference Server with TensorRT-LLM backend](https://nvidia.github.io/TensorRT-LLM/quick-start-guide.html#deploy-with-triton-inference-server) with [openweb-ui](ht…

visitsb updated 3 weeks ago
3
triton-inference-server/server #6780

Install Triton Inference Server without Docker containers

/usr/bin/ld: ../libtritonserver.so: undefined reference to `absl::lts_20220623::StartsWithIgnoreCase(absl::lts_20220623::string_view, absl::lts_20220623::string_view)' /usr/bin/ld: ../libtritonserv…

lyc728 updated 4 months ago
3
fastmachinelearning/SonicCMS #17

Open issues for triton-inference-server (round 2)

Tracking the second round of issues submitted to [triton-inference-server](https://github.com/triton-inference-server/server): - [ ] https://github.com/triton-inference-server/server/issues/2018: Con…

kpedro88 updated 5 months ago
2
triton-inference-server/server #6417

Triton Inference Server installation failure

I use image nvcr.io/nvidia/tritonserver:23.09-py3-min to build triton ; I used the following image nvcr.io/nvidia/tritonserver:23.09-py3-min to build triton to compile and install triton. The com…

zhenxinxu updated 8 months ago
4
triton-inference-server/server #7319

Triton Server 24.05 can't initialize CUDA drivers if host sy…

**Description** I was using Triton Server nvcr.io/nvidia/tritonserver:24.04-py3 on my local machine with Windows 10 via docker container. Ie installed latest Nvidia Driver 555.85, and docker containe…

romanvelichkin updated 3 weeks ago
2
triton-inference-server/server #7020

after calling unload_model capi, the memory is not completel…

**Description** A clear and concise description of what the bug is. before calling unloadmodel，memory isbelow： and after calling unloadmodel，memory isbelow： **Triton Information** What vers…

muyizi updated 2 months ago
2
songquanpeng/one-api #1215

如何调用Triton Inference Server的接口？

**例行检查** [//]: # (方框内删除已有的空格，填 x 号) + [] 我已确认目前没有类似 issue + [] 我已确认我已升级到最新版本 + [] 我已完整查看过项目 README，已确定现有版本无法满足需求 + [] 我理解并愿意跟进此 issue，协助测试和提供反馈 + [] 我理解并认可上述内容，并理解项目维护者精力有限，**不遵循规则的 issue 可能会被…

realcarlos updated 3 months ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for triton-inference-server

1000+ results
for triton-inference-server