Closed vkuzo closed 3 months ago
@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.
@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.
@vkuzo has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.
This pull request has been merged in pytorch-labs/float8_experimental@ab2b828936412ec40bf45f18335ba62fdef60230.
Stack from ghstack (oldest at bottom):
351
350
349
348
347
345
344
Summary:
Removes delayed scaling from
float8_tensor.py
. After this PR, the invariant is that everything infloat8_tensor.py
requires the scale to be calculated elsewhere. This moves the codebase towards separation of concerns for calculating the scale (via various scaling strategies), separated from creating an instance ofFloat8Tensor
.Note that stateful delayed scaling is the reason we need this separation.
Test Plan:
Reviewers:
Subscribers:
Tasks:
Tags:
Differential Revision: D60291447