In your paper, you mentioned that you add orthogonal regularization into your loss function (Equation 12). However, in your training code, I only find the L1 loss between prediction and ground truth (details can be found in here and here). So could you please explain why this L1 loss can achieve the effect of orthogonal regularization?
Hi, Xiankai,
Thanks very much for sharing your code!
In your paper, you mentioned that you add orthogonal regularization into your loss function (Equation 12). However, in your training code, I only find the L1 loss between prediction and ground truth (details can be found in here and here). So could you please explain why this L1 loss can achieve the effect of orthogonal regularization?
Thanks very much for your time!!!