Closed WenY2020 closed 1 year ago
I think I get the answer to my first question: "focalnet_base_lrf.pth" is actually one model of ImageNet-1K Pretrained FocalNet-B. Not sure the second one...
Hi, @WenY2020 ,
thanks for your interest in our work. Regarding your two questions:
This checkpoint is hierarchical which is slightly worse for visualization. I recommend you can also try to use the isotropic focalnets.
It stores the modulator for each focal modulation block so that you can visualize the magnitude of the modulation maps for all focal modulation layers.
thanks, Jianwei
And if you just want to visualize your images, you can directly use this hugging face demo!
thank you very much, Jianwei, those are very helpful, I wll look into here: x_out = q*self.modulator, Great, I need not only the visualization, mainly I need the attention numbers/scores, will need to use it as a feature to feed other models.
Awesome, let me know if you have any further questions.
thanks you, will do:).
Hello, Jianwei:
I am trying to use focalnet to get some attention/focus scores for my images, I see it perform well here in the modulation map of the below images, and this is exactly what I need, I want to calculate each pixel's attention/focus score, I see in your image, the yellow is with highest attention scores (not sure attention score is the right name, basically I want to see which pixels get more attention from people's eyes). And I have two questions:
Thank you very much for your help in advance!
Best wishes, Wen