Closed killwy closed 2 years ago
In your paper,you mentioned that "we switch the activation function to Mish and prolong the pre-training process in the SceneFlow dataset for another 15 epochs".So,should the learning rate change in another 15 epochs?
Hello, for the prolonging 15 epochs, the learning rate is start at 0.001 and downscaled by 2 at epoch 10.
ok,thanks for your instant reply
In your paper,you mentioned that "we switch the activation function to Mish and prolong the pre-training process in the SceneFlow dataset for another 15 epochs".So,should the learning rate change in another 15 epochs?