issues
search
ysh329
/
OpenCL-101
Learn OpenCL step by step.
131
stars
29
forks
source link
cutlass: Efficient GEMM in CUDA
#43
Open
ysh329
opened
3 years ago
ysh329
commented
3 years ago
repo:
https://github.com/NVIDIA/cutlass/blob/master/media/docs/efficient_gemm.md
slide:
https://on-demand.gputechconf.com/gtc/2018/presentation/s8854-cutlass-software-primitives-for-dense-linear-algebra-at-all-levels-and-scales-within-cuda.pdf
blog:
https://developer.nvidia.com/blog/cutlass-linear-algebra-cuda/