Closed weijiekoh closed 9 months ago
Current code is in https://github.com/td-kwj-zp2023/webgpu-msm
Benchmarks on M1
for reference:
For 2^20
inputs, the points were taking too long to load on public wifi.
I'll start a new PR after rebasing the code in https://github.com/td-kwj-zp2023/webgpu-msm to the new structure from #110.
Done:
Todo:
cuzk_gpu()
(contains many, many smaller tasks)