Open flrngel opened 6 years ago
https://arxiv.org/abs/1610.10099 aka ByteNet paper from Deepmind
Notations
Features
(Desiderata is latin word of disideratum, which means model's goal in this paper)
https://arxiv.org/abs/1610.10099 aka ByteNet paper from Deepmind
Notations
Abstract
Features
1. Introduction
2. Neural Translation Model
2.1. Desiderata
(Desiderata is latin word of disideratum, which means model's goal in this paper)
3. ByteNet
3.1. Encoder-Decoder Stacking
3.2. Dynamic Unfolding
3.4. Masked One-dimensional Convolutions
3.5. Dilation
4. Model Comparison
Todo