ethz-asl / hfnet

From Coarse to Fine: Robust Hierarchical Localization at Large Scale with HF-Net (https://arxiv.org/abs/1812.03506)
MIT License
776 stars 185 forks source link

About the training process #55

Closed QsingHuan closed 3 years ago

QsingHuan commented 3 years ago

Dear Author, I've been confused about the training process that you calculate the detector loss using "dense_scores" from teacher model and "logits" from the student model. However, "dense_scores" is derived after a softmax function while "logits" is not. Why don't you use "prob_full" that is calculated in the local_head with softmax function which seems more reasonable? Are there any specific reasons? Looking forward to your reply.