Grayscale Optimal stimuli

Hello, I am trying to understand the working of Lucent a bit better with one of my models. My model is trained on grayscale images (grayscale version of natural images) but uses a VGG16 backbone for feature extraction. Therefore, it accepts 3-channel images just like other torchvision zoo models. When I run Lucent on certain units, it returns RGB optimal stimuli. I believe this is because VGG16 (pretrained) has its own implicit color processing filtering operations that the Lucent optimization framework leverages to return RGB optimal stimuli. However, given that I trained (finetuned) my model on grayscale images, I want to optimize for optimal stimuli in the same space. Is it possible to do this in Lucent? I tried a naive solution, i.e. adding the following transform: rgb2gray_tfo = lambda x: torch.tensordot(x[...,:3],torch.Tensor([0.2989, 0.5870, 0.1140]).cuda(),dims=1).unsqueeze(-1).expand_as(x) which should convert a RGB image to grayscale and repeat it in 3 channels to make the input suitable for passing to the network. However, the optimal Stimuli generated are just blank (gray) images. So, I am wondering if there's a solution to my problem. Thanks in advance. I must add that using Lucent for my project has been amazing so far.😄

greentfrapp / lucent

Grayscale Optimal stimuli #29