A survey paper on the improvement system of the Transformer model, which has been spreading rapidly in recent years, especially in natural language processing. It is summarized in terms of memory , usage pattern of attention, and so on. It gives a good overview of the flow of the paper from the figures and discussions.
TL;DR
A survey paper on the improvement system of the Transformer model, which has been spreading rapidly in recent years, especially in natural language processing. It is summarized in terms of memory , usage pattern of attention, and so on. It gives a good overview of the flow of the paper from the figures and discussions.
Why it matters:
Paper URL
https://arxiv.org/abs/2009.06732
Submission Dates(yyyy/mm/dd)
Authors and institutions
Methods
Results
Comments