Open MarcusLoppe opened 4 months ago
I've had great luck using it in the x-transformers decoder layer and I think it would be a great addition to linear attention.
Let me know if I can help!
I've had great luck using it in the x-transformers decoder layer and I think it would be a great addition to linear attention.
Let me know if I can help!