A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a partially masked target translation.
Other
240
stars
38
forks
source link
Why is the BLEU obtained from the training model provided much higher than the value on paper? #11
I get 34.42 on WMT14 DE->EN, 35.20 on WMT16 EN->RO, 35.62 on WMT RO->EN. These values are much higher than that in origin paper. This is strange, and what happened?
I download the provided trained model, and test on test dataset, but get much higher BLEU than the values in paper.
I use the scripts provided, and don't change anything:
I get 34.42 on WMT14 DE->EN, 35.20 on WMT16 EN->RO, 35.62 on WMT RO->EN. These values are much higher than that in origin paper. This is strange, and what happened?