Open Sundragon1993 opened 3 years ago
Can you tell us which teacher model you used?
Thank you for your reply!
I've tried the 51M_Deeplab_all.json and 51M_Deeplab_incremental.json config file and in addition to this error, the validation procedure after a certain of val_interval was also not working, the error was: the loss (hint loss) has no property backward.
Seems the loss has become a float number rather than a tensor after the validation process...
Here is the full traceback:
There is no update ...
Traceback (most recent call last):
File "train.py", line 92, in
Thanks so much!
First of all, thank you so much for the valuable contribution to the community,
I followed your instructions and successfully train the network with 51M_deeplab_all.json config, everything was fine for several first epochs:
Since then, the losses of student model have somehow drastically surged to an irrational number:
Do you have any idea about this kind of error? I'm using torch: 1.8.0+cu111 and torchvision 0.9.0+cu111. Thank you very much!