About learning rate setting

bryanyzhu / two-stream-pytorch

PyTorch implementation of two-stream networks for video action recognition

MIT License

568 stars 150 forks source link

About learning rate setting #22

Closed bityangke closed 6 years ago

bityangke commented 6 years ago

Hi Yi, How did you decide the lr step ? Did you follow somewhere else or experiment it youself ? Thanks in advance!

bryanyzhu commented 6 years ago

Hi, I decided the lr step according to this paper. But usually in my experiments, I just see when the loss/accuracy saturate, and then decay the lr. I find it more effective.

bityangke commented 6 years ago

Thanks very much.

bityangke commented 6 years ago

Hi Yi I noted that Yuanjun used a batchsize（16x4，16 samples on each card） and iter size 4， so the “batch size” is equal to 256. They used step size 4000, 8000, 10000 （iters） , the corresponding batch steps should be 1000，2000，2500， and epoch number should be about 27，54， 67. Am I right？ Thanks！

bryanyzhu commented 6 years ago

Hi, I am so sorry for the slow response, I am at CVPR this week.

I am also using similar strategy iter_size at here. So my corresponding batch steps are still 4000, 8000 and 10000, which is about 100, 200 and 250. Hope this is clear.

bityangke commented 6 years ago

Thank you！ Hope you have happy CVPR days！