Closed DominiquePivard closed 7 months ago
Hello Dominique and Tommi
I have observed the following but have no idea why.
If the French sentence has no final punctuation or a full stop or even a colon at the end, it is translated correctly, for example
But if I put a semi-colon at the end, I can then produce Dominique's bug
So it possibly has something to do with the semi-colon at the end, however improbable that may sound.
What do you think Tommi?
It's weird, probably due to the training corpus containing many instances of extra spaces before and after apostrophes. The neural network has learned to associate certain trigger words/characters with the extra spaces, so they will manifest semi-randomly. You could call it overlearning by the model, interpreting meaningful patterns in what are just pre-processing errors.
You can use a post-edit rule to fix these kinds of problems, in this case just replacing " ' " with "'" would like work, since I can't think of any legit case of an apostrophe occurring with spaces on both sides.
The following EN segment: No explosive or fire projection effects were observed from the batteries during testing; translates in FR as: Aucun effet d ' explosion ou de projection d ' incendie n ' a été observé dans les batteries au cours des essais;
Any idea why there’s an extra space before and after each apostrophe? It seems to be an isolated case, as I’ve seen many other segments in the same document with apostrophes in FR, but no extraneous spaces.