inference-benchmark Search Results

1000+ results
for inference-benchmark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/onnxruntime #19451

[Performance] CPU inference much slower from GPU runtime

### Describe the issue Hey We are planning to add GPU inference (using Mirosoft.ML.OnnxRuntime.Gpu 1.17.0) as an option in our C# software. However, when switching from the CPU ONNX runtime to th…

oliver-bernhardt updated 3 months ago
7
facebookresearch/maskrcnn-benchmark #293

Strange Problem

## ❓ Questions and Help Traceback (most recent call last): File "webcam.py", line 80, in main() File "webcam.py", line 71, in main composite = coco_demo.run_on_opencv_image(img) F…

engineer1109 updated 5 years ago
6
intel-analytics/ipex-llm #9809

qwen1.8B GPU memory usage is too high

https://www.modelscope.cn/models/qwen/Qwen-1_8B-Chat/summary 使用如上模型在显卡A770上运行，得到如下数据： 32in32 out peak GPU mem:3.1G 2048in512 out peak GPU mem:7.4G 4096in1024 out peak GPU mem:11.6G 8192in2048 o…

juan-OY updated 6 months ago
5
cvc5/cvc5-projects #278

The benchmarks about performance regressions

[benchmarks.zip](https://github.com/CVC4/CVC4/files/6325239/benchmarks.zip) [Statistics.csv](https://github.com/CVC4/CVC4/files/6325242/Statistics.csv) Hi. These days, I collected the benchmarks w…

ConfZ updated 2 years ago
2
facebookresearch/xformers #918

What is the difference between the 4 implementations of FMHA…

Hi all, I'm new to xformers, I'm learning the `examples/llama_inference/generate.py` file. I traced it here: ```python def _memory_efficient_attention_forward( inp: Inputs, op: Optional[Type…

sleepwalker2017 updated 3 months ago
7
aliyun/autosinian-performance-tuning-framework #1

net.config 中的网络表述，好像和RN50不太一样，不知道是不是我理解错了

zcuuu updated 2 years ago
1
huggingface/optimum-benchmark #34

Evaluators for specific tasks

@regisss would it make sense to add task specific evaluators. for example with `automatic-speech-recognition`, as I did it manually when I did whisper's benchmark.

IlyasMoutawwakil updated 10 months ago
5
JuliaLang/julia #55893

Performance regression with Improve printing of several argu…

This pr made the `alloc.strings` benchmark in BaseBenchmarks 2000% slower in min wall time, use 42.25% more memory and increase allocations by 63.53%. Also, it worsened compile performance with `i…

Zentrik updated 2 weeks ago
2
qwopqwop200/GPTQ-for-LLaMa #151

"CUDA Error: No kernel image is available"

My configuration is as follows: - Arch linux, fully up to date, nvidia drivers installed and configured correctly, cuda installed and configured correctly, the works - Podman image build using a c…

Yona-W updated 1 year ago
3
ivadomed/ivadomed #1141

CUDA 10.2 faster than 11 on older hardware

While testing #1129 on the ADS side on `bireli`, we found this weird behavior. Downgrading CUDA from 11.1 to 10.2 speeds up inference (almost twice as fast). `bireli` has a [GeForce GTX TITAN X](ht…

hermancollin updated 2 years ago
8

上一页 1...81 82 83 84 85 86 87...100 下一页

1000+ results for inference-benchmark

1000+ results
for inference-benchmark