inference-benchmark Search Results

1000+ results
for inference-benchmark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mlcommons/policies #176

Allow for larger scale submissions in Inference when moving …

A number of Preview systems in MLPerf Inference v4.0 used fewer cards than would be typical in production due to a limited availability of cards at the time. Rather than benchmarking the systems with …

psyhtest updated 3 months ago
9
livepeer/ai-worker #38

Enhancing VRAM Usage and Inference Speed with Diffusers Opti…

We're exploring various optimizations available in the [Diffusers library](https://huggingface.co/docs/diffusers/main/en/optimization/opt_overview) to enhance VRAM usage and inference speed. @titan-no…

rickstaa updated 5 months ago
4
yfqiu-nlp/sea-llm #2

ModuleNotFoundError: No module named 'decoding_algorithm.con…

Traceback (most recent call last): File ".//src/benchmark_evaluation/bbq_eval.py", line 28, in from decoding_algorithm import Inference File "/sea-llm/src/decoding_algorithm/__init__.py", …

pkulium updated 3 months ago
3
YvanYin/Metric3D #85

Inference Speed data

Hi @JUGGHM, Thank you all for your great work with MMDE. I would like to know if you have some inference time or speed data available on any GPUs or some benchmarks related to that. Best Regard…

a-kungwani updated 2 months ago
2
state-spaces/mamba #414

Mamba2 assertion error

Hi, when running example inference on Mamba2: ``` python benchmarks/benchmark_generation_mamba_simple.py --model-name "state-spaces/mamba2-2.7b" --prompt "My cat wrote all this CUDA code for a new …

wyc1997 updated 1 month ago
3
hpcaitech/ColossalAI #5729

[BUG]: TypeError: LlamaInferenceForwards.llama_causal_lm_for…

### Is there an existing issue for this bug? - [X] I have searched the existing issues ### 🐛 Describe the bug Got `TypeError: LlamaInferenceForwards.llama_causal_lm_forward() got an unexpected keyw…

hiprince updated 4 months ago
1
pytorch/pytorch #133079

Python 3.10 + intel_openmp shows performance regression when…

### 🐛 Describe the bug We are planning upgrading our python environment from 3.8 to 3.10, because pytorch has deprecated python 3.8 recently. But we found that there are performance gaps between pyt…

WeizhuoZhang-intel updated 2 months ago
2
Pointcept/PointTransformerV3 #92

Why did I test the program's inference speed on the computer…

![1725962698758](https://github.com/user-attachments/assets/d18ac98f-7fe9-430c-9377-02529d957823)

amazingpanpanda updated 1 month ago
15
ROCm/AMDMIGraphX #3298

[Issue]: Investigate and Fix GPU error with int8 reduced lay…

### Problem Description Seeing GPU fault when running the onnxruntime-inference-examples script using reduced layer bert models during benchmarking. It appears quantization/calibration steps work …

TedThemistokleous updated 2 months ago
2
octoml/mlcommons-inference #1

Adding TVM to the MLPerf inference benchmark (vision)

Current list of tasks: - [x] threads > 1 do not work - [x] batches > 1 do not work - [x] check object detection task on any model to test TVM integration - [x] detect TVM version via CK package …

gfursin updated 3 years ago
1

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for inference-benchmark

1000+ results
for inference-benchmark