Cycle-consistency loss function is using 'L1-norm'.
I think it is because G(F(x)) is very near to domain x.
x and x^ should be like this:
x [1.0, 1.0, 1.0, 0.0, 0.0, ...]
x^ [1.0, 1.0, 0.9, 0.1, 0.1, ...]
Since almost of all values should have same, the distance is near to zero. So I think L1-norm is useful than L2 norm.
Is this idea right?
Cycle-consistency loss function is using 'L1-norm'. I think it is because G(F(x)) is very near to domain x.
x and x^ should be like this: x [1.0, 1.0, 1.0, 0.0, 0.0, ...] x^ [1.0, 1.0, 0.9, 0.1, 0.1, ...] Since almost of all values should have same, the distance is near to zero. So I think L1-norm is useful than L2 norm. Is this idea right?