NVIDIA / Fuser

A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
Other
271 stars 53 forks source link

Ring-based decomposition for Allgather+GEMM overlap ATen implementation #3392

Open nsarka opened 1 week ago

nsarka commented 1 week ago

Implementation using ATen of https://docs.google.com/document/d/1Fzr9Zs2Dqfj3e4yR8LKxFrRqC1EkMUfQczJMYQGJQUI/edit?tab=t.0#heading=h.5x7hptdjzhet