Closed Hannibal046 closed 2 years ago
Hi, thanks for your interest in our paper! We implement their method with a stronger baseline (attention heads=4, dropout=0.3), while they conducted experiments with Transformer-base, so our BLEU scores are higher than theirs.
Hi, thanks for the great work. I am wondering why there is an obvious gap between the BLEU score in your paper and that in
UVR-NMT
, since you all conduct experiments on Multi30k dataset. Do I miss something here ?Your paper
UVR-NMT paper