inference-benchmark Search Results

1000+ results
for inference-benchmark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

turboderp/exllama #192

Exllama tutorials?

I'm new to exllama, are there any tutorials on how to use this? I'm trying this with the llama-2 70b model.

NickDatLe updated 1 year ago
23
tensorflow/tfjs #6733

[Optimization target] The inference time of DeepLabV3's city…

DeepLabV3 with cityscapes takes long time to execute and we need to figure out the reason and then add it.

Linchenn updated 1 year ago
4
pytorch/pytorch #106614

Case study of torch.compile / cpp inductor on CPU: min_sum /…

### 🐛 Describe the bug (I'll add actual benchmarking details and logs and output_code.py in a bit) I'm doing min_sum and mul_sum in two setups: 1. (D, ) x (D, ) -> scalar 2. (B, N, 1, D) x (B,…

vadimkantorov updated 7 months ago
17
NVlabs/VILA #134

About the inference on video.

./scripts/v1_5/eval/video_chatgpt/run_benchmark_1_correctness.sh ==> output: python3: can't open file '/data1/trinh/code/ViLa/image_text/VILA_0820/llava/eval/video/run_inference_benchmark_general.p…

trinhvg updated 1 month ago
1
godotengine/godot-benchmarks #11

[TRACKER] Benchmarks to create

⚠️⚠️**NOTE**⚠️⚠️ **This list is outdated**, please refer to the following one instead: https://github.com/godotengine/godot-benchmarks/issues/36 ⚠️⚠️**END OF NOTE**⚠️⚠️ This is a list of benc…

Calinou updated 5 months ago
11
tenstorrent/tt-metal #8049

[Falcon40b] Prefill perf burndown

Tracks the open issues for Falcon40b prefill to hit target perf. Last updated: May 27th # Prefill *bfp8* - Measured May 22nd, main - 1 GHz - Perf measurements based on 1 layer perf and ext…

johanna-rock-tt updated 4 months ago
7
yjh0410/yolov2-yolov3_PyTorch #57

About training yolov3 gradient explode problem

![image](https://user-images.githubusercontent.com/78162914/187905422-720f1c81-ec46-4a99-91e3-4592a25f400a.png) When I trained YOLOV3 at Batch Size = 4, ‘nan’ appeared. After Debug, I found the **inf…

kill2013110 updated 2 years ago
5
Tencent/PocketFlow #16

Supported operations of PocketFlow

How many operations are supported under the framework of PocketFlow? I didn't find any docs listing the ops available. By the way, the acceleration ratio according to the performance of mobilenet V1…

lanyastar updated 5 years ago
4
ctuning/ck-mlperf #59

RNN-T CmdGen improvements

The [RNN-T CmdGen](https://github.com/ctuning/ck-mlperf/tree/master/cmdgen/benchmark.speech-recognition-loadgen/.cm) is work-in-progress. We started it for the v0.7 submission round, but eventually di…

psyhtest updated 3 years ago
1
facebookarchive/models #66

Error when loading MaskRCNN2Go

When I run the `run_eval.sh` in the `mask_rcnn_2go`, the error message shows: ``` Traceback (most recent call last): File "code/eval_seg_cpu.py", line 193, in main() File "code/eval_seg_…

hiankun updated 4 years ago
5

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for inference-benchmark

1000+ results
for inference-benchmark