Open Yujun-Shi opened 5 years ago
i guess this is a simplification of the average jacobian_loss, author use the first data of jacobian loss instead of use tf.reduce_mean( tf.reduce_sum( jacobian_loss)), he want to use one singlee of jacobian to apporximate the acerage jacobian average loss.
if you calculate all the data's jacobian loss ,then get average jacobian loss, it woule take much more time to do calculation.
Hi there, I added the following lines to the original code to print the value of the input-output jacobian, and found that only the input-output jacobian of the first image in a batch has value during training. (My tensorflow version is 1.7.0)
Although the parameters of the network could of course still be regularized in this situation (only using one image for every step to regularize jacobian), it's not exactly consistent with the paper.