triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

thu-ml/SageAttention #7

Windows compile issue when testing CogVideoX script

Trying under Windows here (adding to CogVideoX as per your demo script). ``` File "D:\CogVideoX\CogVideo\venv\lib\site-packages\triton\runtime\build.py", line 52, in _build raise RuntimeErr…

SoftologyPro updated 3 weeks ago
8
ultralytics/ultralytics #14710

Specify model version in triton

### Search before asking - [X] I have searched the YOLOv8 [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussions) and fou…

pqthong updated 3 weeks ago
2
triton-inference-server/server #6659

Linking failure of the cilent compilation

**Description** After compiling the triton server using installed libraries in vcpkg , these compiled libraries will cause symbol conflicts and result in the linking failure of the client compilation…

lantudou updated 11 months ago
4
triton-inference-server/tensorrtllm_backend #285

Input tensor 'host_sink_token_length' not found when launch …

I installed tensorrtllm_backend in the follow way: 1. `docker pull nvcr.io/nvidia/tritonserver:23.12-trtllm-python-py3` 2. `docker run -v /data2/share/:/data/ -v /mnt/sdb/benchmark/xiangrui:/root…

xxyux updated 4 weeks ago
18
NVIDIA/TensorRT-LLM #2101

Finding protobuf files while benchmarking TensorRT-LLM

### System Info I am working on the benchmarking suite in vLLM team, and now trying to run TensorRT-LLM for comparison. I am relying on this github repo (https://github.com/neuralmagic/tensorrt-demo)…

KuntaiDu updated 2 months ago
3
triton-inference-server/tensorrtllm_backend #425

Seg fault after loaded models in official example

### System Info arch - x86-64 gpu - rtx3070 docker image nvcr.io/nvidia/tritonserver:24.01-trtllm-python-py3 tensorRT-LLM-backend tag - 0.7.2 tensorRT-LLM tag - 0.7.1 (80bc07510ac4ddf13c0d76ad2…

LeatherDeerAU updated 6 months ago
2
PaddlePaddle/VisualDL #1121

Triton failed to unpack the execution environment tar.gz fil…

Scenario: * I am hosting the paddleocr in triton server via the python backend. * I packed paddleocr and all its dependencies to a tar.gz file following this instruction. https://github.com/tri…

liliang updated 2 years ago
3
triton-inference-server/server #5391

Pass a python dict to triton server python backend

@Tabrizian In [this](https://github.com/triton-inference-server/client/blob/main/src/python/examples/simple_grpc_infer_client.py) example (line 131) they pass a python dict in the `headers` arg of `t…

ukemamaster updated 1 year ago
1
triton-inference-server/server #7007

Triton ensemble pipeline high CPU usage

**Description** I have a 5 steps ensemble pipeline for triton. * 3 steps are torchscript artifacts * 2 steps are tensorrt compiled models in pbtxts files I have ``` instance_group [{ kind: KIN…

sergeevii123 updated 7 months ago
2
XLabs-AI/deforum-x-flux #1

Triton?

![image](https://github.com/user-attachments/assets/b2fbbab3-1cc8-4160-b446-b7e09b8089e7) any suggestions? 11th Gen Intel(R) Core(TM) i7-11800H @ 2.30GHz 8 16 I…

tayshie updated 2 months ago
12

上一页 1...26 27 28 29 30 31 32...100 下一页

1000+ results for triton-server

1000+ results
for triton-server