marian-nmt / marian

Fast Neural Machine Translation in C++
https://marian-nmt.github.io
Other
1.21k stars 227 forks source link

Doc error in the explanation of --lr-decay-inv-sqrt? #397

Open EtienneAb3d opened 1 year ago

EtienneAb3d commented 1 year ago

Hi all, In the documentation, I read: --lr-decay-inv-sqrt: learning rate will be decreased at n / sqrt(no. updates) starting at n-th update

Trying to understand what kind of value I should set for this parameter, I finally conclude it should rather be: lr-init * sqrt(n / no. updates)

Right?