Open LDY1998 opened 1 year ago
Hi, when will the CPU version be available? Looking forward to it.
Hi @SpringWave1, you can try it even now.
a = hidet.randn([3, 4])
b = hidet.randn([4, 3])
c = a @ b
Hidet will generate the matrix multiplication kernel and run it with CPU. However, now our auto-scheduler only generates kernel that uses a single thread, and there is not efficient matmul schedule template. We are still working on these. The good support might come in the end of this year.
This is a tracking issue for adding CPU compilation without CUDA:
176
libc
for memory management instead of CUDA Host189