Visual-Behavior / aloception-oss

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.
Other
92 stars 7 forks source link

RGB mean & std normalization values #302

Open tflahaul opened 1 year ago

tflahaul commented 1 year ago

The "resnet" normalization only works with Imagenet RGB mean and std: ((0.485, 0.456, 0.406), (0.229, 0.224, 0.225)) Being able to specify the mean & std of other datasets at instantiation could be great.

Current workaround:

f = aloscene.Frame(torch.rand(1, 10, 10), names=('C', 'H', 'W'), normalization="resnet")
f.mean_std = ((0.5, 0.5, 0.5), (0.3, 0.3, 0.3))

Also, changing the name of the "resnet" normalization to "z-norm", "meanstd" or something similar could be better for everyones understanding