Open parquets opened 4 years ago
Hi @parquets, FYI, they add softmax when they compute the loss function, see kd loss here https://github.com/peterliht/knowledge-distillation-pytorch/blob/master/model/net.py#L110-L112
thank you, I get it
@peterliht Could you give me pre-trained model for ResNet110?
Hi @parquets, FYI, they add softmax when they compute the loss function, see kd loss here https://github.com/peterliht/knowledge-distillation-pytorch/blob/master/model/net.py#L110-L112