triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/tensorrtllm_backend #501

ailed to read text proto from tensorrtllm_backend/triton_mod…

### System Info [libprotobuf ERROR /tmp/tritonbuild/tritonserver/build/_deps/repo-third-party-build/grpc-repo/src/grpc/third_party/protobuf/src/google/protobuf/text_format.cc:335] Error parsing text-…

alokkrsahu updated 3 months ago
1
triton-inference-server/server #7347

Regression from 23.07 to 24.05 on model count lifecycle/rest…

Hello, thanks for the work being done here. **Description** I'm trying to debug multiples issues that happens on production, and upgrading our Triton Server to 24.05 is one of the solutions i'm …

sboudouk updated 1 month ago
5
triton-inference-server/tensorrtllm_backend #239

The Triton server does not enable In-Flight Batch, and Dynam…

According to the instructions, uncommenting should enable dynamic batching, but despite uncommenting, it has not taken effect. ![企业微信截图_17029850515536](https://github.com/triton-inference-server/te…

StarrickLiu updated 8 months ago
2
triton-inference-server/server #6710

Model Repository Freeze on Python Model Errors in Polling Mo…

**Description** Encountered a critical issue with Triton Inference Server in poll mode where the server becomes unresponsive when loading a Python model with errors. Specifically, if a Python model h…

teith updated 6 months ago
5
triton-inference-server/server #5765

Server should exit on unrecoverable errors in underlying run…

## Description The model is not reloaded when the underlying backend runtime, pytorch_backend and libtorch in this case, causes some errors. In such cases, it would be useful in a production en…

tuxedocat updated 2 weeks ago
6
triton-inference-server/tensorrtllm_backend #563

Triton crashes on boot

### System Info - Hardware: 8x NVIDIA H100 80GB HBM3 - Software: NVIDIA-SMI 535.129.03 Driver Version: 535.129.03 CUDA Version: 12.4 - tensorrtllm_backend commit: [d173386f4dd7b3ed5…

daulet updated 1 month ago
1
triton-inference-server/server #5973

Raise exception when falling back to pinned memory

**Is your feature request related to a problem? Please describe.** Triton has a fallback mechanism for writing intermediates to pinned CPU memory when the CUDA memory pool is full. https://github.c…

david-macleod updated 5 months ago
2
triton-inference-server/server #5813

Install Python Backend via pip locally

**Is your feature request related to a problem? Please describe.** I don't see any possibility to install [python_backend](https://github.com/triton-inference-server/python_backend#python-backend…

MikhailKravets updated 2 months ago
9
triton-inference-server/tensorrtllm_backend #285

Input tensor 'host_sink_token_length' not found when launch …

I installed tensorrtllm_backend in the follow way: 1. `docker pull nvcr.io/nvidia/tritonserver:23.12-trtllm-python-py3` 2. `docker run -v /data2/share/:/data/ -v /mnt/sdb/benchmark/xiangrui:/root…

xxyux updated 2 days ago
16
triton-inference-server/server #6659

Linking failure of the cilent compilation

**Description** After compiling the triton server using installed libraries in vcpkg , these compiled libraries will cause symbol conflicts and result in the linking failure of the client compilation…

lantudou updated 9 months ago
4

上一页 1...22 23 24 25 26 27 28...100 下一页

1000+ results for triton-server

1000+ results
for triton-server