Closed pangjieyu closed 3 years ago
I noticed that the last_weight_accumulator in the train function is not used. Will this cause the network training speed to drop?
you can ignore it, it was for some older experiments.
I noticed that the last_weight_accumulator in the train function is not used. Will this cause the network training speed to drop?