Closed DehuaTang closed 3 years ago
Yes, they're orthogonal techniques. In this work, we find adaptive-KD alone can already give us very promising results and hence, we did not further explore the addition of attentive sampling for the sake of simplicity.
Nice wok ! Thank you for your reply. Adaptive-KD loss work well in my task !
Nice wok ! Thank you for your reply. Adaptive-KD loss work well in my task !
Hello! Did you use the Adaptive-KD loss with attentive sampling in your task?
Hello,
Thank you