HI, thanks for your great work. I'm currently using your pretrained dino network backbone (ViT-S/16) to extract patch (256x256) features.
What mean and std should I use?
ImageNet mean and std mean = (0.485, 0.456, 0.406) & std = (0.229, 0.224, 0.225) ?
Or
mean = (0.5,0.5,0.5) & std = (0.5,0.5,0.5)?
HI, thanks for your great work. I'm currently using your pretrained dino network backbone (ViT-S/16) to extract patch (256x256) features. What mean and std should I use? ImageNet mean and std
mean = (0.485, 0.456, 0.406) & std = (0.229, 0.224, 0.225)
? Ormean = (0.5,0.5,0.5) & std = (0.5,0.5,0.5)
?Thanks