Closed ye-yechen closed 3 years ago
Hi @ye-yechen
Make sure that the repository used in --dump_path
for eval_linear.py
does not have a checkpoint.pth.tar
file.
You should use a different dump_path
for pretraining and evaluation. In your case, the code is trying to re-start the evaluation from checkpoint.pth.tar
which is the checkpoint of pretraining, not evaluation.
ok,so I commented the function restart_from_checkpoint
in eval_linear.py
and I can run the codes.
But, if I want to train the model with the cifar-10 dataset,what parameters are important ? because I get a low accuracy in cifar-10.Thanks.
Good.
I have no experience running models on CIFAR-10.
I don't quite understand how to solve this problem. Can you explain it more clearly? thank you
Hi, thanks for your excellent work! I meet some problems when I run the codes. Firstly,I train the swav model with the command
python -m torch.distributed.launch --nproc_per_node=2 main_swav.py ...
,and the model parameters saved in the checkpoint.pth.tar. But when I run theeval_linear.py
with the pretrained swav model with the commandpython -m torch.distributed.launch --nproc_per_node=2 eval_linear.py --pretrained checkpoint.pth.tar
,I meet some errors,the logs are:Does it means that there are some errors when the optimizer restore from the checkpoints? Could you help me,thanks!