Kyubyong / transformer

A TensorFlow Implementation of the Transformer: Attention Is All You Need
Apache License 2.0
4.28k stars 1.3k forks source link

use zero_padding_mask and bias to replace non-bias linear projection #133

Open Traeyee opened 5 years ago