zwhe99 / FeedbackMT

Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"
https://arxiv.org/abs/2401.12873
19 stars 1 forks source link