Closed sungsooo closed 4 years ago
Please checkout this branch: d1ec858edc25e2671e9a15d5fda4628b9fdbf48b It will fix the nan problem.
Thank you for your reply! I want to know the difference between the two branches. Could you explain why the master branch has occurred the nan problem? Thank you!
We have employed an unstable form of KL divergence. We will fix this bug recently. But the old branch can also have promising results for distillation.
hello, is checkout to master branch?
Hi, I have a problem while training your project. I just clone your repo, and only changed the data path.
But, when I train with PI + PA + HO losses, the loss became nan value after several steps. I used the teacher weights you provide, trained the student net from scratch. Can you advise to reproduce your paper? If you don't mind, can you share your log file?
Thank you.