HobbitLong / RepDistiller

[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods
BSD 2-Clause "Simplified" License
2.17k stars 395 forks source link

No dev set split #59

Closed guzy0324 closed 1 year ago

guzy0324 commented 1 year ago

I noticed the comments in the code, which state that the reported results are from the last epoch and that no dev set is used to find the best epoch. I also found other repositories that don't have a dev set split. I wonder if it's better to have a dev set to tune hyperparameters and find the best checkpoint, or if a dev set is unnecessary for distillation.