Closed deropty closed 3 years ago
Hi, I'm a freshman in the context of knowledge distillation. I wonder why there is no mutlple-gpu training in your code. What is the reason and is there any solution with this question? I'm very appereciate for any response, thank you!
Hi, I'm a freshman in the context of knowledge distillation. I wonder why there is no mutlple-gpu training in your code. What is the reason and is there any solution with this question? I'm very appereciate for any response, thank you!