Closed LiangHann closed 3 years ago
We perform the evaluation on the student net; indeed d_trainer(teacher_net, student_net)
returns the parameters of the student net. We store it in a variable called teacher_net
because we have tried to re-iterate the process (namely, using the student as a teacher for another network). However, as we did not observe any improvement, we simply stopped to one iteration.
I am confused for the evaluation codes: for idx_iteration in range(args.num_generations): print(f'starting generation {idx_iteration+1}') print('#'*100) teacher_net = d_trainer(teacher_net, student_net) d_trainer.evaluate(teacher_net) teacher_net.teacher_mode()
Do you use student network or teacher network for evaluation?