Here you assign NUM_GPUS and original_num_gpus to be the same value, so why do you expect them to change in the "Adaptive changes"?
Also, there are "62723" images (MAX_ITER = 62723), now if you use 4 GPUs then it should come down to 62723/4 ~ 15680. But the MAX_ITER doesn't change during the training, it remains 62727 because the "step_scale" doesn't change from 1 to 1/4, it remains 1, no matter how many GPUs we use.
So, is there any problem with this logic or am I missing something? Can you please explain?
Also, did you train for only one epoch (i.e. 62723 images) or many epochs of 62723 images per epoch?
` ### Adaptively adjust some configs ###
Here you assign NUM_GPUS and original_num_gpus to be the same value, so why do you expect them to change in the "Adaptive changes"? Also, there are "62723" images (MAX_ITER = 62723), now if you use 4 GPUs then it should come down to 62723/4 ~ 15680. But the MAX_ITER doesn't change during the training, it remains 62727 because the "step_scale" doesn't change from 1 to 1/4, it remains 1, no matter how many GPUs we use.
So, is there any problem with this logic or am I missing something? Can you please explain? Also, did you train for only one epoch (i.e. 62723 images) or many epochs of 62723 images per epoch?
Thank you.