marian-nmt / marian-dev

Fast Neural Machine Translation in C++ - development repository
https://marian-nmt.github.io
Other
255 stars 125 forks source link

Marian forgets best ce/bleu/perplexity after restarting from a checkpoint #180

Closed afaji closed 6 years ago

afaji commented 6 years ago

this impact early stopping criteria.

emjotde commented 6 years ago

We are already working on that (issue #64 is getting some attention now). Including saving Adam statistics. This should also save the state of the scheduler and current best validation results, number of stalls.

Later this should also try to resume in roughly the same area in the training corpus. Probably only for the SQLite container.

snukky commented 6 years ago

I store the following scheduler parameters:

Do we need something else?

emjotde commented 6 years ago

Just as a reminder, the validation scores a not correctly recorded which messes up early stopping. Otherwise it seems to run fine.

snukky commented 6 years ago

Should be fixed now.

emjotde commented 6 years ago

How about the direction of validation, i.e. increasing/decreasing scores are better?