Closed captain-pool closed 5 years ago
Have we added any README or documentation that references the paper we are using for this? And what changes we are making to the loss function, etc?
We have 2 options for Loss. We can choose any of these, depending on the output we are getting.
I've implemented both, we will be using each of them based on the output we get.
Adding Trainer for Knowledge Distillation.
Addresses #42, #45 , #46 , #47, #44