facebookresearch / fairscale

PyTorch extensions for high performance and large scale training.
Other
3.15k stars 275 forks source link

Should assign norm_type instead of scale_grad_by_freq #1162

Closed brad-mengchi closed 7 months ago

brad-mengchi commented 7 months ago

What does this PR do?

Fixes #1156 (issue).

Before submitting

PR review

Anyone in the community is free to review the PR once the tests have passed. If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

brad-mengchi commented 7 months ago

Already fixed by https://github.com/facebookresearch/fairscale/pull/1157. so close this.