lancopku / Prime

A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.
Other
87 stars 9 forks source link