pytorch / torchtitan

A native PyTorch Library for large model training
BSD 3-Clause "New" or "Revised" License
2.66k stars 206 forks source link

[BE] remove old pytorch version warning on strided sharding since 2.5 is official released #665

Closed XilunWu closed 3 weeks ago

XilunWu commented 3 weeks ago

Stack from ghstack (oldest at bottom):

507 added a PyTorch version check when users try to use FSDP+TP, to make sure the right PT version includes DTensor strided sharding which assures correct DTensor checkpoint. Since PyTorch 2.5 is official released and strided sharding is included in 2.5, we can safely remove this warning.