[BUG]multi-head attention is noop for BITLINEAR

Describe the bug A clear and concise description of what the bug is and what the main root cause error is. Test very thoroughly before submitting.

To Reproduce Steps to reproduce the behavior:

Go to '...'
Click on '....'
Scroll down to '....'
See error

Expected behavior A clear and concise description of what you expected to happen.

Screenshots If applicable, add screenshots to help explain your problem.

Additional context Add any other context about the problem here.

Upvote & Fund

We're using Polar.sh so you can upvote and help fund this issue.
We receive the funding once the issue is completed & confirmed by you.
Thank you in advance for helping prioritize & fund our backlog.

kyegomez / BitNet

[BUG]multi-head attention is noop for BITLINEAR #24

Upvote & Fund