Open yaroslavb1 opened 6 years ago
@yaroslavb1 could you please provide code for what you mean here?
@tyokota
# smooth the loss value and bias correct
running_loss = self.loss_smoothing_beta * self.running_loss_ + (1. - self.loss_smoothing_beta) * loss
self.running_loss_ = running_loss
running_loss = running_loss / (1. - self.loss_smoothing_beta ** self.current_batch_)
The LR smoothing in line 336 isn't done right. running_loss = self.loss_smoothing_beta loss + (1. - self.loss_smoothing_beta) loss The first loss should be self.runningloss, and self.runningloss should be updated right after.