inference-benchmark Search Results

1000+ results
for inference-benchmark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/optimum #692

Inference with ORT failed with Bloom-560m model

### System Info ```shell Optimum - latest from github Python - 3.8 platform NVidia V100 ``` ### Who can help? @JingyaHuang ### Information - [X] The official example scripts - […

ytaous updated 1 year ago
10
intel/torch-xpu-ops #714

[E2E] Torchbench accuracy the operator 'customflash::custom_…

### 🐛 Describe the bug torchbench_amp_bf16_inference - [ ] `sam_fast` Traceback (most recent call last): File "/home/sdp/actions-runner/_work/torch-xpu-ops/pytorch/benchmarks/dynamo/common.p…

mengfei25 updated 2 months ago
2
pytorch/serve #2743

[RFC] Sequence Batching for Stateful Inference

### 🚀 The feature ## Author: Li Ning ## Background A stateful model possesses the ability to detect interdependencies between successive inference requests. This type of model maintains a persist…

lxning updated 1 month ago
20
leejet/stable-diffusion.cpp #248

Inference bottleneck

What I have experienced is that the inference of cpp on cpu is way too slow compared to the latest [diffusers](https://github.com/huggingface/diffusers). Especially, only the sampling in UNet takes ab…

wonkyoc updated 5 months ago
11
snakers4/silero-vad #2

Changelog - V5 just released!

Just a handy issue to be notified of latest changes and micro-releases (we will mostly changing the models)

snakers4 updated 2 days ago
32
jwyang/graph-rcnn.pytorch #51

Why are all the results zero?

![image](https://user-images.githubusercontent.com/49277976/63659032-ff893400-c7e9-11e9-9207-f330ac7db6e2.png) The command that I use "python main.py --config-file configs/sgg_res101_joint.yaml --i…

jgyy4775 updated 5 years ago
9
MhLiao/MaskTextSpotter #68

Calculated!{"precision": 0, "recall": 0.0, "hmean": 0, "AP":…

i have run the command "python -m torch.distributed.launch --nproc_per_node=1 test_net.py ". In the config file, i used the download “model_finetune.pth” and test it on icdar_test dataset. The outpu…

zhongdajian updated 4 years ago
2
mit-han-lab/smoothquant #70

w8a8 Does it require dequantization during forward inference…

hi, thank you for your open source. I have a few questions about the reasoning of quantitative models. （1） if for the model with only W8A8 quantization, but kv cache does not quantize, whether the fo…

shatealaboxiaowang updated 9 months ago
1
pytorch/pytorch #123558

Support FP16 accumulation for faster LLM inference on 4090 l…

### 🚀 The feature, motivation and pitch ## Background Many existing Large Language Models (LLMs) utilize FP16 during inference to improve performance. Downstream inference libraries, such as vllm, r…

gswxp2 updated 5 months ago
4
PaddlePaddle/PaddleDetection #7648

ValueError: cannot reshape array of size 1 into shape (4) in…

After running below script python3 deploy/python/infer.py --model_dir=output_inference/picodet_lcnet_x1_0_layout/ --image_file=./docs/images/layout.jpg --device=CPU Error msg batch_size: 1 …

TapendraBaduwal updated 6 months ago
3

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for inference-benchmark

1000+ results
for inference-benchmark