I am studying your UDA algorithm and I am trying to adapt it to my specific problem.
My question is the following:
You seem to subtract the fair diversity loss from the entropy loss. Is this intentional? I've re-implemented your code from scratch and it seems that when i also do the same it I can reproduce your results
Initially i've implemented it using the KL divergence from the second equation of your paper and I wouldn't converge to your results.
Hi :),
I am studying your UDA algorithm and I am trying to adapt it to my specific problem.
My question is the following:
Initially i've implemented it using the KL divergence from the second equation of your paper and I wouldn't converge to your results.
Best Regards, Antonios Lykourinas