wangqiangneu / MT-PaperReading

Record my paper reading about Machine Translation and other related works.
36 stars 2 forks source link

19-EMNLP-Mask-Predict: Parallel Decoding of Conditional Masked Language Models #21

Open wangqiangneu opened 4 years ago

wangqiangneu commented 4 years ago

简介

non-autogression decoding. 用iterative的方式不断refine已有的翻译结果。第一轮是完全NAT,然后再次基础上,选择confidence差的N个词,mask掉,去对mask的内容进行refine,迭代X轮结束。对mask的恢复过程类似BERT。整篇文章读起来很舒服,听起来也合理。后面实验部分也很饱满。

论文信息

总结

yokusama commented 4 years ago

为什么distillation必要 这篇感觉做的挺好的: Understanding Knowledge Distillation in Non-autoregressive Machine Translation

wangqiangneu commented 4 years ago

Understanding Knowledge Distillation in Non-autoregressive Machine Translation

多谢~ 学习学习~