triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/server #3984

Batching support by stacking input arrays in python backend

*Is your feature request related to a problem? Please describe.* Triton python backend should provide dynamic batching just like other backends supported by triton. For eg. For the model config ment…

shreypandey updated 1 year ago
2
k2-fsa/sherpa #412

Triton streaming support for old zipformer(pruned stateless …

Hello, I tried to use nvidia triton streaming configuration with pruned stateless 7 streaming model, but it seems that one input is missing to encoder "avg_cache", this seems to be added in new zip…

uni-saurabh-vyas updated 1 year ago
7
MAGICS-LAB/DNABERT_2 #19

CompilationError: at 114:24:

Epoch [1/3] --------------------------------------------------------------------------- KeyError Traceback (most recent call last) File :21, in _fwd_kernel(Q, K, V,…

QAQ1551QAQ updated 4 months ago
24
aws/deep-learning-containers #2599

[feature-request] TensorRT support for PyTorch Serve

Checklist - [x] I've prepended issue tag with type of change: [feature] - [x] (If applicable) I've documented below the DLC image/dockerfile this relates to - [x] (If applicable) I've documented th…

emilwallner updated 6 months ago
3
kubernetes/kubernetes #107928

POD fails to attach correct sriov device on ungraceful node …

### What happened? POD with sriov nic device attached to it fails to attach correct sriov device up on node is hard rebooted after volumes are attached to it. The node is a VM in openstack cloud pr…

rthakur-est updated 5 months ago
15
microsoft/onnxruntime #12044

Using onnxruntime server for model deployment

Is there any way we can save the model with the registered custom ops, so that each time when we load the onnx model we don't have to register the custom ops? Right now every time we load the model, w…

debjyoti003 updated 2 years ago
3
pytorch/pytorch #98707

Ubuntu 22.04 LTS issue <built-in function load_binary> retur…

### 🐛 Describe the bug Greetings, I was directed to this repository as I am encountering an issue with PyTorch. Specifically, I am experiencing an error with loading triton when attempting to ru…

krim404 updated 3 months ago
20
Smorodov/Multitarget-tracker #345

Multi-camera mode

Hey! You have a wonderful project. Tell me, if possible, how to run the example "Calculating the speed of cars using YOLO v4 in real time" and other examples in this repository in multi-camera mode. I…

MichaelBryce90 updated 3 years ago
7
tritonmc/Triton #253

Gradients don't transfer colors to key

### Describe the bug If you create a gradient through MiniMessage and insert a key into it, the first color from the gradient will only be set to the text that came out of the key. ![image](https:…

whereareiam updated 1 year ago
1
triton-inference-server/tensorrtllm_backend #113

When the input contains end_id, the last character of output…

model: baichuan1 13b enable inflight_fused_batching **good case post:** `curl -X POST 10.60.133.200:8030/v2/models/ensemble/generate -d '{"max_tokens": 90, "bad_words": "", "stop_words": "", "t…

PAOPAO6 updated 12 months ago
3

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for triton-server

1000+ results
for triton-server