Closed adricarda closed 4 years ago
Hi, thanks for the code! I was wondering why in stage 2 the loss weight lambda-kl-target is set to 0. From my understanding, the rectification is done by using this term. Thanks in advance.
The lambda-kl-target term is 0 because it is only used in stage 1 when minimizing the discrepancy between the two classifiers.
Hi, thanks for the code! I was wondering why in stage 2 the loss weight lambda-kl-target is set to 0. From my understanding, the rectification is done by using this term. Thanks in advance.