batch-inference Search Results

1000+ results
for batch-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

modelscope/FunCodec #41

运行run.sh在stage 4时报错

报错信息如下所示： run.pl: job failed, log is in /mnt/d/Work/FunCodec/egs/LibriTTS/text2speech_laura/dump/libritts/test-other/codecs//logdir/inference.1.log cat: '/mnt/d/Work/FunCodec/egs/LibriTTS/text2speec…

voidpointz updated 3 weeks ago
3
sonos/tract #1559

TensorFlow Lite Operator Support Query

Hello! I’ve been working with TensorFlow Lite models in tract and have come across a model which cannot be loaded. I've compared the operators used in themodel with those mentioned in the [README](…

mtobin-tdab updated 7 hours ago
4
EleutherAI/lm-evaluation-harness #2379

lm_eval --model vllm did not work when data_parallel_size > …

We noticed that lm_eval --model vllm did not work when data_parallel_size > 1 and got `Error: No available node types can fulfill resource request` from Ray. After some research, I believe when `tenso…

wukaixingxp updated 1 month ago
4
EvolvingLMMs-Lab/lmms-eval #264

[new model] qwen2-vl support

Any plan to support the latest Qwen2-VL model evaluation?

jxgu1016 updated 1 month ago
3
edwardlib/observations #22

tensorflow.contrib.data in observations

I am heavily using `tf.contrib.data datasets` api for image based tasks. With observations for images (LSUN/celebA) etc being no more than an downloader for these datasets, would it be worthwhile to r…

Arvinds-ds updated 7 years ago
9
luckycallor/InsightFace-tensorflow #18

Why do you have configurations of different batch sizes?

I saw that in your configure files, you have batch size of 100, 200, 128, 256. Does this affect how I use this model to do inference? Do I have to pad my image data, e.g., 1 image to 100 images in ord…

BruceDai003 updated 5 years ago
1
NVIDIA-AI-IOT/Lidar_AI_Solution #243

export-scn.py Error:

(mitbevfusion) gss@Gss:~/Lidar_AI_Solution/CUDA-BEVFusion$ python qat/export-scn.py Tracing model inference... > Do inference... --> SparseConvolutionQunat0[subm] -> Input 0, Output 1 Tracebac…

Bonasdljlkj updated 3 months ago
1
triton-inference-server/server #7594

GPU memory is not released by Triton

**Description** Triton does not clear or release GPU memory when there is a pause in inference. In the attached diagrams the same model is being used. It is served via ONNX. ![image (1)](https:…

briedel updated 1 month ago
3
karpathy/nanoGPT #253

attn_mask for inference

This is more of a question for my understanding. I understand that at training time each sequence is of fixed length (and not padded) so the attention mask can be constructed using a triangular matrix…

david-waterworth updated 1 year ago
3
elastic/elasticsearch #106184

[ML] Inference API splitting large bulk requests

### Description The inference API supports client side batching by leveraging the `input` array field. External services implement different limits for batched requests. Cohere limits the text to [96…

jonathan-buttner updated 7 months ago
1

上一页 1...86 87 88 89 90 91 92...100 下一页

1000+ results for batch-inference

1000+ results
for batch-inference