Does anybody know why using 32 channels of noise for training, and the width and height of the target image also crop to an integer multiple of 32? Does it mean if we use three channels of noise as input the width and height of the target image need to crop to an integer multiple of 3? We can share more discussion and insight here about DIP.
Does anybody know why using 32 channels of noise for training, and the width and height of the target image also crop to an integer multiple of 32? Does it mean if we use three channels of noise as input the width and height of the target image need to crop to an integer multiple of 3? We can share more discussion and insight here about DIP.