Closed tthakkal closed 3 months ago
Enabled fused_sdpa flash attention for starcoder2 model
Fixes # (issue)
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.
What does this PR do?
Enabled fused_sdpa flash attention for starcoder2 model
Fixes # (issue)
Before submitting
Who can review?
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag members/contributors who may be interested in your PR.