triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #6907

[Bug]: cuda OOM errors persist across requests.

### Your current environment ```text The output of `python collect_env.py` Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch…

servient-ashwin updated 2 months ago
3
triton-inference-server/server #5349

Question about server/docs/examples/stable_diffusion/

In the demo, it accelerate vae model by # Accelerating VAE with TensorRT trtexec --onnx=vae.onnx --saveEngine=vae.plan --minShapes=latent_sample:1x4x64x64 --optShapes=latent_sample:4x4x64x64 --m…

BobDLA updated 1 year ago
5
triton-inference-server/fastertransformer_backend #119

model output must specify 'data_type' for fastertransformer

### Description ```shell 按照教程跑CUDA_VISIBLE_DEVICES=1,2 mpirun -n 1 --allow-run-as-root /opt/tritonserver/bin/tritonserver --model-repository=${WORKSPACE}/all_models/bert/ 命令时报错： E0412 06:53:22.3687…

18810251126 updated 1 year ago
6
triton-inference-server/server #7148

client silent failure - E0422 05:03:24.145960 1 pb_stub.cc:4…

**Description** The model repo is an object detection ensemble, which consists of a preprocessor written with the Python backend, and the main model in TensorRT plan. The Python backend uses CuPy t…

jrcavani updated 4 months ago
8
TritonDataCenter/docker-machine-driver-triton #12

Machine driver for Rancher

I'd like to add the triton machine driver to Rancher (rancher.com). Docs: http://rancher.com/docs/rancher/v1.3/en/configuration/machine-drivers/ Where can I find the "machine driver binary 64-bit…

neuroserve updated 2 years ago
6
triton-inference-server/server #4668

One click deployment to GKE no longer works as Istio depreca…

**Description** Istio is deprecated on GKE. You cannot create a cluster with Istio add-on, and as such you cannot use the one-click deployer. **Triton Information** Not applicable GKE version 1.…

samsaam-thg updated 1 year ago
6
triton-inference-server/fil_backend #273

Does fil support lightgbm predict leaf index?

Return leaf index of each tree, just like original lightgbm c api param predict_type=C_API_PREDICT_LEAF_INDEX.

chenglin updated 2 years ago
3
CaoWGG/TensorRT-CenterNet #60

Quite different bbox size between onnx and tensorrt

Hi, I run the centerface onnx model and find although faces are detected the bbox size are quite different between onnx model and tensorrt model. ![out](https://user-images.githubusercontent.com/3515…

Harryqu123 updated 4 years ago
1
cloudevents/sdk-go #735

How to properly set Content-Encoding?

We are using cloudevents/sdk-go (v1.2.0) to forward payloads to our request logger. Triton server recently (sometime between 20.08 and 21.08) started to respect `Accept-Encoding` header and now re…

RafalSkolasinski updated 1 year ago
5
NVIDIA/TensorRT-Model-Optimizer #27

Request for Documentation of custom quantization algorithm /…

Hello, I have had a trouble that quantized model with AWQ has performance degradation more than expected. I know that ModelOpt provides optimized kernels and quantization algorithms for fast quanti…

nuxlear updated 5 months ago
2

上一页 1...85 86 87 88 89 90 91...100 下一页

1000+ results for triton-server

1000+ results
for triton-server