Open FrancescoSaverioZuppichini opened 1 year ago
I want to add/experiment with Token Merging to create different models sizes. However, is not possible to easy add it if you use nn.MultiHeadAttention but some ideas are presented in this issue
nn.MultiHeadAttention
I want to add/experiment with Token Merging to create different models sizes. However, is not possible to easy add it if you use
nn.MultiHeadAttention
but some ideas are presented in this issue