triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT-LLM #2004

Not found: unable to load shared library: libtensorrt_llm.so…

Hello, I want to deploy llama-3-8b quantized model using tritonserver I followed below steps to do this: 1. create container with nvcr.io/nvidia/tritonserver:24.06-trtllm-python-py3 base image. 3.…

nikhilcms updated 1 month ago
11
facebookresearch/seamless_communication #460

Deployment of Seamless M4T Model - Exporting text.decoder to…

#### Description I am currently working on deploying the Seamless M4T model for text-to-text translation on a Triton server. I have successfully exported the `text.encoder` to ONNX and traced it …

HesamAlavian updated 1 month ago
2
triton-inference-server/server #7189

Unexpected reshaping of output

**Description** I have specified [-1, 1024] as the output dimensions for my ensemble model, but the output is still reshaped to [1024]. **Triton Information** NVIDIA Release 24.03 (build 86102629…

lemousehunter updated 1 day ago
2
triton-inference-server/server #7611

What is the latest triton server release version available f…

I know that 2.18+ supports pytorch and we want to use that. jetson nano have jetpack 4.6.4 latest version. can this version install triton 2.20 we need pytorch and python backend supports

HuseyinSaidKoca updated 1 month ago
3
triton-inference-server/server #6417

Triton Inference Server installation failure

I use image nvcr.io/nvidia/tritonserver:23.09-py3-min to build triton ; I used the following image nvcr.io/nvidia/tritonserver:23.09-py3-min to build triton to compile and install triton. The com…

zhenxinxu updated 1 year ago
4
PaddlePaddle/FastDeploy #2528

镜像Dockerfile问题请教

请问 registry.baidubce.com/paddlepaddle/fastdeploy:llm-base-gcc12.3-cuda11.8-cudnn8-nccl2.15.5 的dockerfile方便提供一下吗？

Aganlengzi updated 1 month ago
1
SthPhoenix/InsightFace-REST #77

Triton Model Server with Mxnet Models

Were you able to run mxnet models with Triton Inference Server?

zeynepkoyun updated 2 years ago
1
ultralytics/ultralytics #16038

'AutoBackend' object has no attribute 'task'

### Search before asking - [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and found no similar bug report. ### Ultralytics YOLO Component _No …

Liagos updated 2 months ago
2
triton-inference-server/tensorrtllm_backend #625

The GPU memory usage is too high.

### System Info cpu intel 14700k gpu rtx 4090 tensorrt_llm 0.13 docker tritonserver:24.09-trtllm-python-py3 ### Who can help? @Tracin ### Information - [X] The official example scri…

imilli updated 1 week ago
1
speechbrain/speechbrain #2733

Investigate custom Triton kernels for depthwise-separable co…

### Describe the bug A decent chunk of time in the Conformer model at training time is spent in the convolution module. Of that, a decent chunk is in the depthwise convolution, which sets `groups` to…

asumagic updated 1 week ago
2

上一页 1...8 9 10 11 12 13 14...100 下一页

1000+ results for triton-server

1000+ results
for triton-server