Closed mannatsingh closed 3 years ago
Summary: This reduces the memory consumption in the layer by half, and speeds up training
Differential Revision: D29577476
This pull request was exported from Phabricator. Differential Revision: D29577476
Summary: This reduces the memory consumption in the layer by half, and speeds up training
Differential Revision: D29577476