This repository contains a reading list of Software Engineering papers and articles!
0 stars 0 forks source link

Paper Review: Attention Is All You Need #42

Open parvezmrobin opened 2 years ago

parvezmrobin commented 2 years ago


Advances in Neural Information Processing Systems 30 (NIPS 2017)

Link to The Paper


Name of The Authors

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, Illia Polosukhin

Year of Publication



This paper is the first attempt to build a sequence-to-sequence model entirely based on attention. It sets new state-of-the-art BLEU scores for WMT English to German (by 2 BLEU score) and English to French (by 0.7 BLEU score) translations. As it eradicates the need for sequential operations (size of the sequence in RNNs, number of layers in CNNs), it can be trained much faster by using parallelization.

Contributions of The Paper


It would be a disrespect to put any comment on this paper.