Closed vkuzo closed 2 weeks ago
@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.
This pull request has been merged in pytorch-labs/float8_experimental@3ec96650001126283002cc83595fdbf9c605090d.
Stack from ghstack (oldest at bottom):
300
299
298
297
296
293
291
290
Summary:
Makes the DTensor TP/SP tests also test
Float8Linear
with all scaling types configured to be dynamic.We can add support for delayed scaling with float8 all-gather for
x
anddL_dY
in a future PR, as needed.Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
Differential Revision: D59305797