pentium3 / sys_reading

system paper reading notes
235 stars 12 forks source link

PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel #342

Open pentium3 opened 8 months ago

pentium3 commented 8 months ago

https://arxiv.org/pdf/2304.11277.pdf