Closed iory closed 5 years ago
This PR modified the arguments of distributed adamw. local_rank and world_size 's default value is None and automatically set dist.get_rank() and dist.get_world_size() values. I also add the test code of DistributedAdamW and DistributedSGD.
local_rank
world_size
dist.get_rank()
dist.get_world_size()
DistributedAdamW
DistributedSGD
Thanks!
This PR modified the arguments of distributed adamw.
local_rank
andworld_size
's default value is None and automatically setdist.get_rank()
anddist.get_world_size()
values. I also add the test code ofDistributedAdamW
andDistributedSGD
.