Closed jackwilkie closed 1 year ago
intersample attention is now wrapper for nn.MultiheadAttention to leverage flash attention and efficency improvements
intersample attention is now wrapper for nn.MultiheadAttention to leverage flash attention and efficency improvements