[ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods
BSD 2-Clause "Simplified" License
2.11k
stars
389
forks
source link
How can I use CRD_loss to face landmark detetct for model compression? There is no "opt.nce_k: number of negatives paired with each positive". #22
Open
gjd2017 opened 4 years ago