THUNLP-MT / MT-Reading-List

A machine translation reading list maintained by Tsinghua Natural Language Processing Group
BSD 3-Clause "New" or "Revised" License
2.42k stars 448 forks source link
machine-translation reading-list

Machine Translation Reading List

This is a machine translation reading list maintained by the Tsinghua Natural Language Processing Group.

The past three decades have witnessed the rapid development of machine translation, especially for data-driven approaches such as statistical machine translation (SMT) and neural machine translation (NMT). Due to the dominance of NMT at the present time, priority is given to collecting important, up-to-date NMT papers; the Edinburgh/JHU MT research survey wiki has good coverage of older papers and a brief description for each sub-topic of MT. Our list is still incomplete and the categorization might be inappropriate. We will keep adding papers and improving the list. Any suggestions are welcome!

10 Must Reads

Tutorials and Surveys

Statistical Machine Translation

Word-based Models

Phrase-based Models

Syntax-based Models

Discriminative Training

System Combination

Human-centered SMT

Interactive SMT

Adaptation

Evaluation

Neural Machine Translation

Model Architecture

Attention Mechanism

Open Vocabulary

Training Objectives and Frameworks

Decoding

Low-resource Language Translation

Semi-supervised Learning

Unsupervised Learning

Pivot-based Methods

Data Augmentation Methods

Data Selection Methods

Transfer Learning

Meta Learning

Multilingual Machine Translation

Prior Knowledge Integration

Word/Phrase Constraints

Syntactic/Semantic Constraints

Coverage Constraints

Document-level Translation

Robustness

Interpretability

Linguistic Interpretation

Fairness and Diversity

Efficiency

Pre-Training

Non-Autoregressive Translation

Speech Translation and Simultaneous Translation

Multi-modality

Ensemble and Reranking

Domain Adaptation

Quality Estimation

Human-centered NMT

Interactive NMT

Automatic Post-Editing

Poetry Translation

Eco-friendly

Compositional Generalization

Endangered Language Revitalization

Word Translation

WMT Winners

WMT is the most important annual international competition on machine translation. We collect the competition results on the news translation task since WMT 2016 (the First Conference of Machine Translation) and summarize the techniques used in the systems with the top performance. Currently, we focus on four directions: ZH-EN, EN-ZH, DE-EN, and EN-DE. The summarized algorithms might be incomplete; your suggestions are welcome!

WMT 2019

WMT 2018

WMT 2017

WMT 2016