prlz77 / ResNeXt.pytorch

Reproduces ResNet-V3 with pytorch
MIT License
509 stars 119 forks source link

About initial learning rate #15

Closed lld533 closed 5 years ago

lld533 commented 5 years ago

Hi,

May I know what's the initial learning rate used in Cifar10 and Cifar100 experiments (-b 128 on 2 GPU cards)? The default value 0.1 or the sample value 0.05? Many thanks in advance!

prlz77 commented 5 years ago

Hi!

Following the original LUA implementation, for BS=128, the learning rate should be 0.1, since they use lr=0.05 for BS=64 and 0.025 for BS=32.

lld533 commented 5 years ago

Hi! Following the original LUA implementation, for BS=128, the learning rate should be 0.1, since they use lr=0.05 for BS=64 and 0.025 for BS=32.

Thanks!