Open kwonmha opened 2 years ago
I saw scores = dropout(scores) in line 331 of akt.py This is the first time I see dropout applied to attention weight. Any reference or reason for this?
scores = dropout(scores)
akt.py
I saw
scores = dropout(scores)
in line 331 ofakt.py
This is the first time I see dropout applied to attention weight. Any reference or reason for this?