apache / tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators
https://tvm.apache.org/
Apache License 2.0
11.41k stars 3.4k forks source link

[WebGPU] Add `tir.dp4a` #17124

Closed Jiawei-Shao closed 4 days ago

Jiawei-Shao commented 1 week ago

This patch adds tir.dp4a as a new TIR built-in operator as a preparation of supporting int8 computation with dot4I8Packed in WebGPU backend.

issue: https://github.com/apache/tvm/issues/16627

Jiawei-Shao commented 1 week ago

@tqchen PTAL, thanks!