facebookresearch / d2go

D2Go is a toolkit for efficient deep learning
Apache License 2.0
829 stars 197 forks source link

Fix key error 0 in multinode training #579

Closed frabu6 closed 1 year ago

frabu6 commented 1 year ago

Summary: Current code assumed training runs only on one node, and there is always a global rank0 on each node. This assumption fails on multinode training, resulting in a key 0 error.

Reviewed By: crassirostris

Differential Revision: D46841286

facebook-github-bot commented 1 year ago

This pull request was exported from Phabricator. Differential Revision: D46841286

facebook-github-bot commented 1 year ago

This pull request has been merged in facebookresearch/d2go@783288394b9ac27b63cc816f751c2f4d6efe8fdc.