in the paper, page 5, the Fm is said to be provided into 3 line, where a matrix multiplication operation was perform between 2 of them to create channel attention map A, but i can't see that in the paper. is that a new change? or it's a more effective way?
in the paper, page 5, the Fm is said to be provided into 3 line, where a matrix multiplication operation was perform between 2 of them to create channel attention map A, but i can't see that in the paper. is that a new change? or it's a more effective way?