issues
search
Kyubyong
/
transformer
A TensorFlow Implementation of the Transformer: Attention Is All You Need
Apache License 2.0
4.28k
stars
1.3k
forks
source link
why use dropout in function scaled_dot_product_attention?
#154
Open
asmartsnail
opened
4 years ago