Open DevinCheung opened 1 month ago
Thank you for your interest in our work! self.betabase in our code is applied to constrcut margins for balanced margin softmax, so it is unnecessary to update it. alpha[targetsx] * self.betabase is refer to ∆^{t}{y} in Eq. (3).
Thanks for the great work. I am confused with self.betabase in trainer.py. Does it refer to ∆^{t}_{y} in Eq. (3) in the main paper? Also it seems that it is initialized but not updated during training. Coud you please help explain this? Thanks!