triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/onnxruntime #12044

Using onnxruntime server for model deployment

Is there any way we can save the model with the registered custom ops, so that each time when we load the onnx model we don't have to register the custom ops? Right now every time we load the model, w…

debjyoti003 updated 2 years ago
3
triton-inference-server/server #6896

thread control for pytorch backend to fix the issue of PyTor…

**Is your feature request related to a problem? Please describe.** For now the Tensorflow and ONNX backends in Triton support thread controls ([here](https://github.com/triton-inference-server/tens…

yongbinfeng updated 6 months ago
3
triton-inference-server/server #6643

Can ensemble models cache ?

**Description** Caching is not woring with ensemble models. **Triton Information** 23.07 **Are you using the Triton container or did you build it yourself?** Triton container **To Reproduce*…

haiminh2001 updated 9 months ago
2
InternLM/xtuner #726

how to train multi tasks on different gpus at the same time?

i have 2 x a100 gpus， i hava been training one task on gpu1, and i want to train another tasks on gpu2 at the same time, but i get error as followings: ``` CUDA_VISIBLE_DEVICES=1 \ xtuner t…

ztfmars updated 3 weeks ago
2
apache/mxnet #20220

Dynamic Batching during Inference / Runtime

First, thanks for creating this great and high performant framework! I've looked in the open and closed issues and couldn't find this one. ## Description It would be really cool to be able to enabl…

andreas-solti updated 3 years ago
1
triton-inference-server/server #6997

Error generating stream: TextEncodeInput must be Union[TextI…

hi everyone i runing tritonserver vllm and i want runing with dynamic batching, but i encountered an error. It seems like it has something to do with my input Inference with curl: curl -X POST loca…

thanhtung901 updated 6 months ago
3
nvidia-riva/nemo2riva #36

Conformer CTC converted with nemo2riva 2.13.1 deployed on Ri…

I have a conformer CTC model built with the NeMo framework (https://github.com/NVIDIA/NeMo), which can be normally converted and deployed with Riva 2.11.0. However, if I convert the same NeMo file to …

itzsimpl updated 7 months ago
1
Smorodov/Multitarget-tracker #345

Multi-camera mode

Hey! You have a wonderful project. Tell me, if possible, how to run the example "Calculating the speed of cars using YOLO v4 in real time" and other examples in this repository in multi-camera mode. I…

MichaelBryce90 updated 2 years ago
7
xebd/accel-ppp #154

stack-buffer-underflow in reload_exec

Using version `accel-ppp version 1.12.0-149-gff91c73` The function `reload_exec` can cause `stack-buffer-underflow`: Here is the asan report: ``` ============================================…

GoldBinocle updated 2 years ago
2
autopilotpattern/telegraf #2

MVP+1: RFD27 integration

[RFD27/Container Monitor](https://github.com/joyent/rfd/blob/master/rfd/0027/README.md) integration requires two things: 1. TLS certs based on a user's SSH key 2. Discovery of RFD27 endpoints ### Auth…

misterbisson updated 7 years ago
1

上一页 1...88 89 90 91 92 93 94...100 下一页

1000+ results for triton-server

1000+ results
for triton-server