pytorch / torchtitan

A native PyTorch Library for large model training
BSD 3-Clause "New" or "Revised" License
1.28k stars 115 forks source link

Rmsnorm cuda #349

Closed lessw2020 closed 1 month ago

lessw2020 commented 1 month ago

cuda RMSNorm working