Open zdgithub opened 4 years ago
Could the distillation networks be trained on multiple gpus?
That would be definitely worth trying. Don't have the bandwidth to work on that though
Could the distillation networks be trained on multiple gpus?