marian-nmt / marian

Fast Neural Machine Translation in C++
https://marian-nmt.github.io
Other
1.21k stars 227 forks source link

negative cost with negative weights #404

Open tomsbergmanis opened 1 year ago

tomsbergmanis commented 1 year ago

The documentation says that word weights can be real-valued. Real numbers can be negative. This is convenient because outputs of models one could use for scoring often output log-likelihoods, which are negative numbers. image However, using negative weights results in a negative cost value, which, if minimized, maximizes the error: image

Cheers, Toms