inference-benchmark Search Results

1000+ results
for inference-benchmark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

openvinotoolkit/openvino #26826

[Bug]: benchmark_app is crashing on NPU with Yolov10s

### OpenVINO Version 2024.4.0-16579-c3152d32c9c-releases/2024/4 ### Operating System Other (Please specify in description) ### Device used for inference NPU ### Framework None ### Model used …

tbujewsk updated 3 days ago
3
mlcommons/cm4mlops #319

Check that older SCC'23 and 22 tutorials do not use external…

We need to check that all the changes in the external forks of MLCommons repos used in our past SCC'22 and 23 tutorials are merged with MLCommons mainline repositories. We can then update these tu…

gfursin updated 1 week ago
1
deepseek-ai/DeepSeek-V2 #21

Reproduce inference benchmark mentioned in the paper

I have a few questions about the inference efficiency of deepseek v2 1. > In order to efficiently deploy DeepSeek-V2 for service, we first convert its parameters into the precision of FP8. Ar…

zhouheyun updated 2 months ago
4
THUDM/ChatGLM-6B #24

Inference speed benchmark?

Cool model! I'll have a try. I'd like to know 5 token/s minimal hardware requirement.

wizd updated 1 year ago
3
RVC-Boss/GPT-SoVITS #1598

Inference Speed Benchmark 推理速度测评 [COME AND SHARE YOUR SPEED!…

I hope that everyone enter this issue can share your system, CPU, GPU, inference speed in GPT stage, and the version you use(better to compare v2 as a standard). so that we could see how different me…

JunityZhan updated 2 days ago
20
mlcommons/cm4mlops #253

Running mlperf inference benchmarks on systems without inter…

Some edge systems may not be connected to internet and we need a way to run mlperf inference benchmarks on them using CM.

arjunsuresh updated 3 weeks ago
2
HabanaAI/vllm-fork #274

[Usage]: The TP improvement is not as expectation

### Your current environment The offline inference of Llama-3-8B with benchmark_latency.py sweeping on 1, 2, 4 cards results: And the optimum-habana results: The results show that on 1 card…

JunxiChhen updated 2 weeks ago
1
PumasAI-Labs/Bayesian-Benchmarks #110

Benchmarking Bayesian Inference in Pumas

Tracker issue for the educational project

andreasnoack updated 6 months ago
1
mlcommons/inference #1842

The documentation is not showing correctly...

Check https://docs.mlcommons.org/inference/benchmarks but found: @ashwin @jdduke @codyaustun @badenh @koichishirahata

zhimin-z updated 1 month ago
3
pytorch/pytorch #134709

[dashboard][aarch64] fp16 is slower than bf16

From https://github.com/pytorch/pytorch/pull/134282#issuecomment-2307157197, in the aarch64 dashboard results, if we benchmark with fp16, it is 2x~10x slower than bf16, often causing timeout in cases.…

desertfire updated 1 month ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for inference-benchmark

1000+ results
for inference-benchmark