while the weighting and bias are fixed at 1 and 0 respectfully,### the mean and var in it should be learned in the training process. Right?
2.the author state in the paper that the bn layer before DIF is not learned. Then it is the mean and var are calculated every time for different input?
there are some statements needed to make sure:
2.the author state in the paper that the bn layer before DIF is not learned. Then it is the mean and var are calculated every time for different input?