Closed kunal-vaishnavi closed 1 month ago
This PR adds support for packing the bias after a packed QKV MatMul.
This PR is a follow up to this PR.
Description
This PR adds support for packing the bias after a packed QKV MatMul.
Motivation and Context
This PR is a follow up to this PR.