JordanAsh / badge

An implementation of the BADGE batch active learning algorithm.
197 stars 32 forks source link

Different loss function #14

Open TobiArndt opened 1 year ago

TobiArndt commented 1 year ago

Hi! Very great work and wonderfull code. Thanks man! Have you tried non cross-entropy-based loss functions? I think off loss function in unsupervised methods for anomaly detection for filtering out anomly datapoints. One example for such a loss function is the differences between student and teacher intermediate layers. Am I right, that if it only depends on gradient, we could skip the hallucination-part and use the derivate only? Have a nice day Tobi