triton-server Search Results

1000+ results
for triton-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Smorodov/Multitarget-tracker #345

Multi-camera mode

Hey! You have a wonderful project. Tell me, if possible, how to run the example "Calculating the speed of cars using YOLO v4 in real time" and other examples in this repository in multi-camera mode. I…

MichaelBryce90 updated 3 years ago
7
triton-inference-server/tensorrtllm_backend #113

When the input contains end_id, the last character of output…

model: baichuan1 13b enable inflight_fused_batching **good case post:** `curl -X POST 10.60.133.200:8030/v2/models/ensemble/generate -d '{"max_tokens": 90, "bad_words": "", "stop_words": "", "t…

PAOPAO6 updated 12 months ago
3
SkinsRestorer/SkinsRestorer #1286

add Kaiiju support

### Is there an existing issue for this? - [X] I have searched the existing issues ### Are you using forge? No ### Installed conforming to our guide? - [X] I have read the installation guide and …

mani1232 updated 1 year ago
6
microsoft/onnxruntime #12044

Using onnxruntime server for model deployment

Is there any way we can save the model with the registered custom ops, so that each time when we load the onnx model we don't have to register the custom ops? Right now every time we load the model, w…

debjyoti003 updated 2 years ago
3
huggingface/text-generation-inference #2593

Server error: transport error

### System Info We are deploying the model meta-llama/Meta-Llama-3.1-70B-Instruct with FP8 quantization and everything works perfectly for hours until the server crashes with this error: 2024-10-…

ismael-dm updated 5 days ago
2
louisgv/local.ai #107

Notes

CUDA supports: https://github.com/kimlimjustin/xplorer/blob/master/src/Service/app.ts https://github.com/launchbadge/sqlx https://github.com/Jimver/cuda-toolkit https://github.com/LLukas22/llm-r…

louisgv updated 1 year ago
8
ray-project/ray #19425

Support static conda envs using conda-pack

Currently we only support dynamically-installed conda environments, but that is not well-suited for production usage. @jiaodong I think we should make this a requirement for OSS jobs release. ``…

edoakes updated 1 year ago
2
kleveross/ormb #47

[feature] Provider unified offline batch inference interface

**Is this a BUG REPORT or FEATURE REQUEST?**: > Uncomment only one, leave it on its own line: > > /kind bug > /kind feature **What happened**: Investigate if we can use https://github.…

gaocegege updated 4 years ago
6
k2-fsa/sherpa #412

Triton streaming support for old zipformer(pruned stateless …

Hello, I tried to use nvidia triton streaming configuration with pruned stateless 7 streaming model, but it seems that one input is missing to encoder "avg_cache", this seems to be added in new zip…

uni-saurabh-vyas updated 1 year ago
7
pytorch/pytorch #115138

ImportError: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.20'…

### 🐛 Describe the bug cross-posting from https://github.com/VKCOM/YouTokenToMe/issues/113 (since I'm not sure if it belongs in pytorch or YouTokenToMe): reduced from a more complex example: ``…

timotheecour updated 7 months ago
1

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for triton-server

1000+ results
for triton-server