Open Flova opened 1 year ago
hi ,Regarding what you mentioned:linear instead of step wise decay, I have reviewed your code and found that there is only one learning rate decay strategy, which is to adjust according to steps. How can I perform linear decay or other decay methods as for using sgd ,i think you are right for most trains and i successed when i using sgd in trianing bdd100k
Proposed changes
Related issues
818
820
Necessary checks