What is the base learning rate for batch size 512?

facebookresearch / vicreg

VICReg official code base

MIT License

523 stars 88 forks source link

What is the base learning rate for batch size 512? #23

Open cneyang opened 1 year ago

cneyang commented 1 year ago

The original paper states that the base lr is 0.4.

However, 8-GPU single node training script says 0.3.

python -m torch.distributed.launch --nproc_per_node=8 main_vicreg.py --data-dir /path/to/imagenet/ --exp-dir /path/to/experiment/ --arch resnet50 --epochs 100 --batch-size 512 --base-lr 0.3

Which one is correct?