Hi,
I finetuned the pytorch model on WMT, from scratch, after 50000 iterations, I was getting 1 blue score.
I think the way data processing is done, does not match the way decoding works in huggingface model, resulting in low performance for the HF model.
It would be great if this can be fixed. thanks.
Hi
I am not sure if the issue is with setting, could you please run your codes on WMT and verify it is working?
I tried multiple times, and the bleu score is hardly get to 1. thanks.
Hi, I finetuned the pytorch model on WMT, from scratch, after 50000 iterations, I was getting 1 blue score.
I think the way data processing is done, does not match the way decoding works in huggingface model, resulting in low performance for the HF model. It would be great if this can be fixed. thanks.