batch-inference Search Results

modelscope/swift #1278

Batch inference

How to perform batch inference with swift? I don't see it mentioned anywhere in the docs and I cannot find it in the code either.

VietDunghacker updated 21 hours ago

jasonppy/VoiceCraft #143

Batch Inference

Hi, Is it possible to batch inference like LLMs do? Such as provide 10 transcripts and batch the requests to increase total throughput?

nickmitchko updated 1 week ago

BAAI-DCAI/Bunny #93

Batch inference

Hi! I'm evaluating the model on a relatively large dataset (single question, single answer). I was able to fine-tune the Bunny-1.1-Llama-3-8B-V model using one of the scripts provided. What is the …

mtsysin updated 3 days ago

THU-ESIS/Chinese-Mistral #4

batch inference

Hi authors, I want to test the performance of the Mistral7B on the test dataset. Is it only possible to do single sample inference (with model. generate(...))? Are there any methods to accelerate t…

x6p2n9q8a4 updated 3 weeks ago

limuloo/MIGC #9

Batch inference

A great job, are there any tips on setting up bounding boxes to perform batch inferencing?

iu110 updated 1 month ago

Blaizzy/mlx-vlm #48

Batch inference support (self-assigning)

Would be nice to have batch inference support similar to [`mlx_parallm`](https://github.com/willccbb/mlx_parallm), happy to try and add soon. @Blaizzy can you assign this to me?

willccbb updated 6 days ago

NVIDIA/TensorRT-LLM #1879

batch inference is different with single

### System Info x85-64 4 A10 0.9.0 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] An officially supported tas…

1096125073 updated 4 days ago

abetlen/llama-cpp-python #771

Add batched inference

- [x] Use `llama_decode` instead of deprecated `llama_eval` in `Llama` class - [ ] Implement batched inference support for `generate` and `create_completion` methods in `Llama` class - [ ] Add suppo…

abetlen updated 5 hours ago

xinntao/Real-ESRGAN #634

Batched inference?

Hi there! Great work! Is it possible to run a batched inference? Thanks!

pablopalafox updated 4 weeks ago

magic-research/PLLaVA #16

Does it support batch inference

Wonder does it support batch inference? I read the code of eval. Seems each time it only eval on one video.

QiaoZhennn updated 19 hours ago

1000+ results for batch-inference

1000+ results
for batch-inference