Open ArbinTimilsina opened 3 years ago
Pinging @patil-suraj too, and @mrm8488 might have played with that model in the past.
Any progress here? I've faced the exact same problem when attempting to translate from Spanish, although slightly different output:
The Committee recommends that the State party take all necessary measures to ensure that the right to adequate housing is guaranteed in the State party's next periodic report, and that the State party take all necessary measures to ensure that the right to adequate housing is guaranteed in its next periodic report.
@patil-suraj - could you take a look here?
+1 I've been having the same issue translating from Spanish to English. Could someone take a look?
Environment info
transformers
version: 4.9.1Who can help
@patrickvonplaten
Information
I am seeing weird behavior with mBART-50 and Spanish. Please look at the code below:
The output is:
However if I change the source language to french
tokenizer.src_lang = "fr_XX"
or any other language, I get the following output (which is what you expect):This behavior is similar with other texts as well (e.g., "888"). Do you know why this behavior is unique to Spanish? Also, do you have any idea how to correct this behavior?
Thanks!