-
**What is your question?**
Hello!
I’ve been exploring the Cutlass examples for GEMM and Convolution and noticed the use of double buffering.
https://developer.nvidia.com/blog/cutlass-linear-algebra-…
-
Steps to reproduce:
1. Login in the Viewer.
2. Click "Communicate"→"Gestures".
3. Try quickly and simultaneously pressing different types of gestures.
Actual behavior:
Crash the viewer after pressi…
-
I would like to see memory bandwidth metrics from Cadvisor. There are specific metrics named "container_memory_bandwidth_bytes".
I know they are under resctrl. However, I can not enable the cadvis…
-
I used a ARM machine to test the end-to-end output, but the performance does not match the results mentioned in the paper. The tested data of llama.cpp and T-MAC is nearly same. I've posted the measur…
-
-
### Environment
Second Life Release 7.1.11.11565212741 (64bit)
Release Notes
CPU: Intel(R) Core(TM) i7-5930K CPU @ 3.50GHz (3491.92 MHz)
Memory: 32610 MB
OS Version: Microsoft Windows 10 64-bit (Bui…
-
Environment
Both DeltaFPS (7.1.10.10800445603) and ExtraFPS (7.1.11.11565212741)
You are at 121.0, 110.0, 23.2 in Pasta Cake located at simhost-054bc31cb9bd66326.agni
SLURL: http://maps.secondlife.…
-
I haven't looked through the code, but I copied the bandwidth and OpenSBLI performance numbers into LibreOffice, made a scatter plot out of it, and the R^2 is coming out as 0.49 (=> R = 0.7), which is…
-
CUDA programming , which is essential for ML/AI optimization, is incredibly sought in the ML industry especially as we entered the LLM era. In order to make the neural network training faster and more…
-
I am trying to profile my application on AMD Ryzen Threadripper 3960X (Zen2 arch). Specifically, I am trying to measure memory bandwidth and I get all 0's in the output. The OS I am using is Ubuntu 18…