-
I'm new to exllama, are there any tutorials on how to use this? I'm trying this with the llama-2 70b model.
-
DeepLabV3 with cityscapes takes long time to execute and we need to figure out the reason and then add it.
-
### 🐛 Describe the bug
(I'll add actual benchmarking details and logs and output_code.py in a bit)
I'm doing min_sum and mul_sum in two setups:
1. (D, ) x (D, ) -> scalar
2. (B, N, 1, D) x (B,…
-
./scripts/v1_5/eval/video_chatgpt/run_benchmark_1_correctness.sh
==> output:
python3: can't open file '/data1/trinh/code/ViLa/image_text/VILA_0820/llava/eval/video/run_inference_benchmark_general.p…
-
⚠️⚠️**NOTE**⚠️⚠️ **This list is outdated**, please refer to the following one instead:
https://github.com/godotengine/godot-benchmarks/issues/36
⚠️⚠️**END OF NOTE**⚠️⚠️
This is a list of benc…
-
Tracks the open issues for Falcon40b prefill to hit target perf.
Last updated: May 27th
# Prefill
*bfp8*
- Measured May 22nd, main
- 1 GHz
- Perf measurements based on 1 layer perf and ext…
-
![image](https://user-images.githubusercontent.com/78162914/187905422-720f1c81-ec46-4a99-91e3-4592a25f400a.png)
When I trained YOLOV3 at Batch Size = 4, ‘nan’ appeared. After Debug, I found the **inf…
-
How many operations are supported under the framework of PocketFlow? I didn't find any docs listing the ops available.
By the way, the acceleration ratio according to the performance of mobilenet V1…
-
The [RNN-T CmdGen](https://github.com/ctuning/ck-mlperf/tree/master/cmdgen/benchmark.speech-recognition-loadgen/.cm) is work-in-progress. We started it for the v0.7 submission round, but eventually di…
-
When I run the `run_eval.sh` in the `mask_rcnn_2go`, the error message shows:
```
Traceback (most recent call last):
File "code/eval_seg_cpu.py", line 193, in
main()
File "code/eval_seg_…