batch-inference Search Results

1000+ results
for batch-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

coqui-ai/TTS #3814

[Feature request] Batch inference onnx Model

Thanks for your excellent work. I see that onnx model (for example Vit converted to onnx) is potential if it can inference with batch inputs because of reducing time and boosting performance. N…

phamkhactu updated 5 days ago
1
Blaizzy/mlx-vlm #48

Batch inference support (self-assigning)

Would be nice to have batch inference support similar to [`mlx_parallm`](https://github.com/willccbb/mlx_parallm), happy to try and add soon. @Blaizzy can you assign this to me?

willccbb updated 3 weeks ago
2
abetlen/llama-cpp-python #771

Add batched inference

- [x] Use `llama_decode` instead of deprecated `llama_eval` in `Llama` class - [ ] Implement batched inference support for `generate` and `create_completion` methods in `Llama` class - [ ] Add suppo…

abetlen updated 11 hours ago
32
microsoft/onnxruntime-genai #720

Can I inference Phi-3-vision with batch?

Thanks for the conversion code for phi3-vision. I'm making a app for concurrent requests that need continuous batching. Can I inference phi3-vision with batchsize larger than 1 ( I mean in onnx mode…

2U1 updated 20 hours ago
1
NVIDIA/TensorRT-LLM #1879

batch inference is different with single

### System Info x85-64 4 A10 0.9.0 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported tas…

1096125073 updated 1 week ago
8
xinntao/Real-ESRGAN #634

Batched inference?

Hi there! Great work! Is it possible to run a batched inference? Thanks!

pablopalafox updated 1 month ago
2
magic-research/PLLaVA #16

Does it support batch inference

Wonder does it support batch inference? I read the code of eval. Seems each time it only eval on one video.

QiaoZhennn updated 1 hour ago
3
exo-explore/exo #1

[BOUNTY - $200] Batched Requests

**Motivation:** Batching multiple inference requests together can speed up inference. Batching can even be leveraged with single-input settings for speedups with e.g. staged speculative decoding. *…

AlexCheema updated 6 days ago
2
psipred/s4pred #5

Batch inference

I would like to perform batch inference. Can you please point me some resources or provide support for it? Thanks a lot

elejota1 updated 3 months ago
3
neonbjb/tortoise-tts #694

Batch Inference?

So we're having issues inferencing efficiently at scale, and of course we're processing the audio parts one by one as is default for inference, but is there any support for batch inference to speed th…

addytheyoung updated 2 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for batch-inference

1000+ results
for batch-inference