flrngel / understanding-ai

personal repository
36 stars 6 forks source link

Neural Machine Translation in Linear Time #11

Open flrngel opened 6 years ago

flrngel commented 6 years ago

https://arxiv.org/abs/1610.10099 aka ByteNet paper from Deepmind

Notations

Abstract

Features

1. Introduction

2. Neural Translation Model

image

2.1. Desiderata

(Desiderata is latin word of disideratum, which means model's goal in this paper)

3. ByteNet

3.1. Encoder-Decoder Stacking

3.2. Dynamic Unfolding

3.4. Masked One-dimensional Convolutions

3.5. Dilation

4. Model Comparison

image image

Todo