leaderj1001 / Stand-Alone-Self-Attention

Implementing Stand-Alone Self-Attention in Vision Models using Pytorch
MIT License
456 stars 83 forks source link

Question about einsum. #13

Closed veritas9872 closed 1 year ago

veritas9872 commented 4 years ago

Hello. I was going through the attention implementation and had a question about the operation by the einsum function. I am not very familiar with the Einstein summation convention and I believe that there are many others like myself. I would be very grateful if explanations about the operation were added as comments. As it is, I find it difficult to understand which parts correspond to what in the paper. Many thanks if you could help me out.