fp16-matrix value added for all VK_KHR_cooperative_matrix capable devices, such as rtx20+ and rdna3
It reflects the computing power of tensorcore or similar AI engine on the device
At the moment, all nvidia turing+ devices are known to work
rdna3 device works with the latest windows driver (130Tflops+ measured on my 7900xtx graphic)
In the future, the linux mesa driver will follow up, bring this extension for intel etc.
https://github.com/nihui/vkpeak/releases/tag/20230812
fp16-matrix value added for all
VK_KHR_cooperative_matrix
capable devices, such as rtx20+ and rdna3 It reflects the computing power of tensorcore or similar AI engine on the deviceAt the moment, all nvidia turing+ devices are known to work rdna3 device works with the latest windows driver (130Tflops+ measured on my 7900xtx graphic)
In the future, the linux mesa driver will follow up, bring this extension for intel etc.
sample output on nvidia t4