gaopengcuhk / Stable-Pix2Seq

A full-fledged version of Pix2Seq
Apache License 2.0
235 stars 20 forks source link

about settings of learning rate #10

Open youngsheen opened 2 years ago

youngsheen commented 2 years ago

In the original paper, learning rate was set to 3e-3 and weight decay was set to 5e-2, why do u use the learning rate 1e-5 and weight decay 1e-4 in the code? BTW, can u give the NLL_Loss when the model convergences, just for reference. Thanks!