Open wushilian opened 6 years ago
hello,do you understand the method that full connection layers is used in getting initial variables?? @wushilian
and there still exists a question ,how to understand the way that use MLP to get the weights of attention?? @wushilian @DeepRNN
how about using the mean of conv feature directly?? @DeepRNN @wushilian
Which Attention did the author used in his code? Stochastic “Hard” Attention or Deterministic “Soft” Attention? @wushilian @kstys @DeepRNN @whguan
I think it's the soft attention @jsd1994wxj
In the code,you use fully_connect layer to calculate attention,why don't use formula in <>?