problem with lr - Githubissues

hdjang / Feature-Selective-Anchor-Free-Module-for-Single-Shot-Object-Detection

A PyTorch Implementation of Feature Selective Anchor-Free Module for Single-Shot Object Detection (CVPR'19)

Apache License 2.0

143 stars 38 forks source link

Hi Ken

Regarding learning-rate scheduling, yes you should apply linear learning-rate rule according to the changed batch size. However, with linear learning-rate rule in my experiments where I resized input size and changed total iterations for fast prototyping, I found it led to a divergence for the FSAF model. To remedy this issue, I tried default learning-rate and it produced stable training. Thus, I used default learning-rate for both baseline and FSAF.

Additionally, for the original setting where 8 GPUs and larger input size are used, linear learning-rate scheduling might "not" lead to the divergence. But, I didn't check it due to my limited resources.

hdjang / Feature-Selective-Anchor-Free-Module-for-Single-Shot-Object-Detection

problem with lr #2