inference-benchmark Search Results

1000+ results
for inference-benchmark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Vchitect/VEnhancer #18

Can you please provide Inference time benchmarks?

What is the time taken for processing a 3/5/10s video? on a A100/H100 GPU. Would be great to know any of your existing benchmarks!

cbsudux updated 2 weeks ago
3
fabio-sim/Depth-Anything-ONNX #26

Inference benchmarks for V2: Depth anything V2

I tried to replicate the results from your benchmarks using docker gpu with this docker image nvidia/cuda:12.6.1-cudnn-devel-ubuntu22.04 and nvidia/cuda:12.1.0-cudnn8-devel-ubuntu22.04. After I instal…

sarmientoF updated 3 weeks ago
1
triton-inference-server/server #7692

Whats the query to calculate triton model latency per reques…

We are doing benchmarking of triton with different backends, but unable to get the metric the calculate the latency of each request (lets assume each request has batch size of `b`) 1. Is request la…

jayakommuru updated 3 days ago
1
CentML/flexible-inference-bench #64

Data Generation of random negative number throws error occas…

just wanted to point this out. Occasionally you get the following error when using inference benchmark : ``` 2024-09-17 07:56 INFO User selected random dataset. Generating prompt and output l…

vvagias updated 3 weeks ago
3
open-compass/VLMEvalKit #497

textonly benchmarks

hi， how can i apply a textonly benchmark in this inference framework

JcWang20 updated 3 days ago
1
awslabs/data-on-eks #660

Run llmperf as a container for benchmarking Ray vLLM Inferen…

### Community Note * Please vote on this issue by adding a 👍 [reaction](https://blog.github.com/2016-03-10-add-reactions-to-pull-requests-issues-and-comments/) to the original issue to help the…

ratnopamc updated 2 weeks ago
2
mlcommons/cm4mlops #300

Test SDXL MLPerf inference on AMD GPU with ROCm for SCC'24

https://docs.mlcommons.org/inference/benchmarks/text_to_image/reproducibility/scc24

gfursin updated 2 weeks ago
3
FlagOpen/FlagPerf #754

stable_diffusion_v1_4使用torchtrt推理时报错

ERROR: [Torch-TensorRT] - Unsupported operator: aten::to.dtype_layout(Tensor(a) self, *, ScalarType? dtype=None, Layout? layout=None, Device? device=None, bool? pin_memory=None, bool non_blocking=Fals…

MaltoseFlower updated 2 weeks ago
2
DimaBir/ResNetTensorRT #9

ONNX Inferece with CUDA Runtime

Hey, If you are open for this feature, i can add onnx inference benchmark with cuda execution provider.

deo-abhijit updated 1 week ago
2
fixie-ai/thefastest.ai #34

how can my inference platform join the benchmark?

hello,I have setup a inference platform with more than 100 GPUS which can provide inference service for prevalent llm, I want to join this benchmark ,so how can I do it?

nickwind updated 2 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for inference-benchmark

1000+ results
for inference-benchmark