Closed ZhiyuLi-goog closed 2 months ago
Before: https://screenshot.googleplex.com/5LhbKL58gBwAGUr After: https://screenshot.googleplex.com/BFziQHuGGDuAjhN
0.5% to 0.8% improvement after borrowing sharding from paxml. +6% improvement in int8 with some better layout in backwards, still WIP in analyzing.
LGTM! Please wait for @RissyRan approval as well
Need code owner's approval. Thank you @gobbleturk.
Before: https://screenshot.googleplex.com/5LhbKL58gBwAGUr After: https://screenshot.googleplex.com/BFziQHuGGDuAjhN
0.5% to 0.8% improvement after borrowing sharding from paxml. +6% improvement in int8 with some better layout in backwards, still WIP in analyzing.