Closed 123456789asdfjkl closed 1 year ago
Thanks for your interest. After transforming t to t' through a scale&shift operation, we get paired (f, t') which has shape (seq_len, hiddendim), then g can be computed through codes like following: `g = torch.softmax(torch.sum(torch.matmul(f,t'.T),dim=1),dim=0).unsqueeze(1)`
I see. Thanks for your answer!
@123456789asdfjkl Hello~ have you reproduced successfully?
Hi! Thanks for your great work! I want to know how the gated mechanism is implemented?