Closed YanjunChen329 closed 1 year ago
This pull request was exported from Phabricator. Differential Revision: D46951673
This pull request was exported from Phabricator. Differential Revision: D46951673
This pull request has been merged in facebookresearch/d2go@b0abd7aa0024abb7c12c333db8bd3f48db054328.
Summary: Disable FSDP mixed precision for model buffers. Buffers are usually small in size so there's very limited performance gain for enabling mixed precision. Plus, applications like BatchNorm layers and diffusion models are very sensitive to the precision of buffers. Thus, we stick to full precision for buffers in FSDP.
Differential Revision: D46951673