Open web3-panpan opened 5 years ago
attention_new = torch.max(attention, dim=-1, keepdim=True)[0].expand_as(attention) - attention I don't understand
same
attention_new = torch.max(attention, dim=-1, keepdim=True)[0].expand_as(attention) - attention I don't understand