Closed MnLgt closed 2 months ago
the transform function of https://github.com/tencent-ailab/IP-Adapter/blob/main/tutorial_train.py#L43 is for VAE of SD. for clip, we use https://github.com/tencent-ailab/IP-Adapter/blob/main/tutorial_train.py#L49
Ah, of course. I was getting them mixed up. Thank you.
Hi,
I really love IP-Adapter! I'm wondering why you chose to normalize the image with 0.5
instead of the clip normalization of
The reason I ask is that I'm looking to train an IP-Adapter Plus using DinoV2 as the image encoder and I'm not sure whether to use the normalization used in the tutorial train plus of 0.5, the standard CLIP normalization or the DinoV2 normalization of
Thanks so much.