Open sukjunhwang opened 1 month ago
Previous code transposes the tensor first, making it (B, L, E)->(B, E, L), then reshapes afterwards, leading to an incorrect tensor arrangement.
Fixes # (issue)
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.
Thank you! cc @ArthurZucker
Previous code transposes the tensor first, making it (B, L, E)->(B, E, L), then reshapes afterwards, leading to an incorrect tensor arrangement.
What does this PR do?
Fixes # (issue)
Before submitting
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.