Open Jokeren opened 4 years ago
I just attended your presentation. Is there anywhere I can find the Winograd code as described in your paper? Much appreciate it.
Thank you for listening!
We plan to open source the Winograd implementation later.
Before that, we will add more examples including saxpy and GEMM.
Hi! Any update on the saxpy and GEMM examples? Turingas looks super useful, thank you!
Hi, I have read your paper and very interested in the program. Is there any update of the CUDA program of the Winograd Algorithm? I've also read the code of Xuqiantong's CUDA-Winograd repositories (https://github.com/daadaada/CUDA-Winograd) and think the implement of the matrix multiplication need more optimization. I'd like to know how you implement this algorithm, thx!
same question, where to find the code which the paper described
hey,
Is it possible to see the example where you reorder the sass instruction for better latency hiding? Many thanks
Best,
I just attended your presentation. Is there anywhere I can find the Winograd code as described in your paper? Much appreciate it.