Here you assign NUM_GPUS and original_num_gpus to be the same value, so why do you expect them to change in the "Adaptive changes"?
Also there are "62723" images (MAX_ITER = 62723), now if you use 4 GPUs then it should come down to 62723/4 ~ 15680. But the MAX_ITER doesn't change during the training, it remains 62727 because the "step_scale" doesn't change from 1 to 1/4, it remains 1, no matter how many GPUs we use.
So, is there any problem with this logic or am I missing something? Can you please explain?
` ### Adaptively adjust some configs ###
Here you assign NUM_GPUS and original_num_gpus to be the same value, so why do you expect them to change in the "Adaptive changes"? Also there are "62723" images (MAX_ITER = 62723), now if you use 4 GPUs then it should come down to 62723/4 ~ 15680. But the MAX_ITER doesn't change during the training, it remains 62727 because the "step_scale" doesn't change from 1 to 1/4, it remains 1, no matter how many GPUs we use.
So, is there any problem with this logic or am I missing something? Can you please explain?
Thank you.