Open risingClouds opened 9 months ago
the log as follow:
I have the exact same issue. I don't seem to understand why loss does not drop.
How many epochs did you train it for?
The problem was solved. This happened because I was not using copy.deepcopy to define the student and teacher in the beginning.
Maybe you can use the SGD optimizer. I'm trying to follow this training command from the author on a single GPU.
python main_dino.py --arch resnet50 --optimizer sgd --lr 0.03 --weight_decay 1e-4 --weight_decay_end 1e-4 --global_crops_scale 0.14 1 --local_crops_scale 0.05 0.14 --data_path /path/to/dataset/train --output_dir /path/to/saving_dir
When I try to trian resnet50 by dino on on X-Ray dataset with 1000 images, the loss is not drop, even increase some time. Have anyone met the same issue and solve it. the config as follow:(at first, I make all the hyper parmeter default, but the loss is not decrease)