flrngel / understanding-ai

personal repository
36 stars 6 forks source link

Dual Learning for Machine Translation #13

Open flrngel opened 6 years ago

flrngel commented 6 years ago

https://arxiv.org/abs/1611.00179 paper from USTC, PKU, Microsoft Research (NIPS 2016)

Summary

Model

  1. Prepare 2 agents LM_{a,b} which learned languages(en,fr; WMT14) that outputs log probability
  2. 2 translation models P(•|s;Θ_{AB,BA}) are needed
  3. Feed P's output to LM and use policy gradient to train
  4. Flip data and restart training until model converges

Abstract

1. Introduction

2. Background: Neural Machine Translation

3. Dual Learning from Neural Machine Translation

image

5. Discussions