robertostling / hnmt

Helsinki Neural Machine Translation system
GNU General Public License v3.0
29 stars 16 forks source link

Coverage vector #3

Open Waino opened 7 years ago

Waino commented 7 years ago

Implement coverage in the attention mechanism, following [1].

[1] Tu, Zhaopeng, et al. "Coverage-based Neural Machine Translation." arXiv preprint arXiv:1601.04811 (2016). http://arxiv.org/pdf/1601.04811

robertostling commented 7 years ago

Perhaps a better (= quicker to implement) start would be to implement the decoding-time coverage normalization method from section 7 of the Google NMT paper. This would only require changing the HNMT code so that it returns attention predictions, and then modifying the beam search code in BNAS to use it.