Open cneyang opened 1 year ago
The original paper states that the base lr is 0.4.
However, 8-GPU single node training script says 0.3.
python -m torch.distributed.launch --nproc_per_node=8 main_vicreg.py --data-dir /path/to/imagenet/ --exp-dir /path/to/experiment/ --arch resnet50 --epochs 100 --batch-size 512 --base-lr 0.3
Which one is correct?
The original paper states that the base lr is 0.4.
However, 8-GPU single node training script says 0.3.
Which one is correct?