marian-nmt / marian

Fast Neural Machine Translation in C++
https://marian-nmt.github.io
Other
1.25k stars 233 forks source link

How to deal with copied words in source sentences #267

Open lkluo opened 5 years ago

lkluo commented 5 years ago

I am sorry this issue is not directly related to the project.

In MT, some words/phrases are not translated, but copied from source sentences, such as person names, company names, etc. It occurs to me that there could be two approaches:

I tried Google translate and other translate apps, and to some extend, I found their systems can determine the copied words/phrases, though not perfectly. Could someone advise, in general, what is the best solution to this problem? Thanks.