Issue: When distributed strategies beyond FSDP (e.g. DDP, single-GPU) were introduced, the default strategy was changed from FSDP to single-GPU. This is backwards-incompatible, with existing configs/runs broken by this change. In general when we add new settings we should set backwards-compatible defaults.
Fix: Set default distributed_strategy to DistributedStrategy.fsdp
Issue: When distributed strategies beyond FSDP (e.g. DDP, single-GPU) were introduced, the default strategy was changed from FSDP to single-GPU. This is backwards-incompatible, with existing configs/runs broken by this change. In general when we add new settings we should set backwards-compatible defaults.
Fix: Set default
distributed_strategy
toDistributedStrategy.fsdp