marian-nmt / marian

Fast Neural Machine Translation in C++
https://marian-nmt.github.io
Other
1.21k stars 227 forks source link

Why are only one or two first sentences translated? #389

Open molokanov50 opened 2 years ago

molokanov50 commented 2 years ago

Hello team,

I'm just trying to translate long texts consisting of 6 - 8 sentences but not exceeding 100 tokens in general (in order to not overcome model's memory consumption) using pretrained Helsinki-NLP Opus models. My input file is composed of every such text per separate line. According to the recommended workflow, I initially run preprocess.sh. The preprocessed texts contain all the terms present in the original texts. Then I perform translation with marian-decoder. But my marian-decoder output turns out to contain translations of only one or two initial sentences of each original text. Why does it take place? Is there any workaround?