Closed YukeWang96 closed 3 years ago
Page 44 of https://www.nvidia.com/content/PDF/nvidia-ampere-ga-102-gpu-architecture-whitepaper-v2.pdf
As to running tensor cores on 3090, see this https://discuss.tvm.apache.org/t/rfc-byoc-nvidia-cutlass-integration/9147/24?u=hwu36
Thanks a lot!
Hi,
is there any reference for the peak performance of INT1, INT4, INT8, INT16, INT32 for RTX3090 on Tensorcore? Just want to compare my current CUTLASS GEMM versus the theoretical peak performance.
Thanks!