Codes for our paper "Progressive Differentiable Architecture Search:Bridging the Depth Gap between Search and Evaluation"
Other
360
stars
83
forks
source link
why the learning rate and batchsize in "train_imagenet.py" are different from what said in the paper? Which one can reproduce the result?And is the learning rate related to batchsize ? #14
Sorry for the late reply.
Yes, you should adjust the learning rate according to batch size. The learning rate in train_imagenet.py performs similarly to the one in the paper according to our experiments.
Sorry for the late reply. Yes, you should adjust the learning rate according to batch size. The learning rate in
train_imagenet.py
performs similarly to the one in the paper according to our experiments.