Closed a-r-r-o-w closed 4 days ago
The scripts already supported DeepSpeed from the beginning but there was a bug in the lr scheduler part, which has now been fixed I believe. For enabling DeepSpeed, we just need to configure with accelerate correctly (I'll push configs of those in a follow-up PR)
Based on this comment