Open masaisai111 opened 1 month ago
in fact, we only use center crop during training.
There are some differences between the photos obtained after clipping and the feature pictures extracted by clip, for example, the edge area is clipped off. Then why can the photos obtained after clipping be used for noise pictures? Won't there be some information conflict
what do you mean "clipping"
this,If the size of the training data I set is not square, the phenomenon of cropping will occur when training XL, and the edge information of the picture will be cropped,
Why crop the image with noise, but do not crop the reference image when extracting features, which will not result in feature mismatch
random crop