Closed shenyehui closed 1 year ago
Is it the teacher model that becomes the distilled model after feature distillation, and then the distilled model becomes the fine-tuning model after fine-tuning?
Is it the teacher model that becomes the distilled model after feature distillation, and then the distilled model becomes the fine-tuning model after fine-tuning?