It seems the architectures are different, is any code part out of date here please, I am still in the visualization.ipynb to run.
Thank you!
And also about the model choices:
I see the visualization in huggingface you shared last time is quite good of the attentions visualisation, the model behind should be 'focalnet_base_iso_16.pth' right? as I checked the files there and if I understand correctly.
If my focus is to generate attention scores for all pixels in images, what models do you recommend, if pre-trained model: focalnet_base_iso_16.pth is best/good? or I'd better training one on my own data (mostly online advertisments images) based on some pre-trained-model? (It seems there is no way to evaluate the performance of attentions scores of the models except by eyes/intuitions.)
Hello @jwyang: I have another problem when I try the isotropic focalnets model of 'focalnet_base_iso_16.pth':
I initialize the model by
and load 'focalnet_base_iso_16.pth' by
but I have the error as below:
It seems the architectures are different, is any code part out of date here please, I am still in the visualization.ipynb to run. Thank you!
And also about the model choices:
I see the visualization in huggingface you shared last time is quite good of the attentions visualisation, the model behind should be 'focalnet_base_iso_16.pth' right? as I checked the files there and if I understand correctly.
If my focus is to generate attention scores for all pixels in images, what models do you recommend, if pre-trained model: focalnet_base_iso_16.pth is best/good? or I'd better training one on my own data (mostly online advertisments images) based on some pre-trained-model? (It seems there is no way to evaluate the performance of attentions scores of the models except by eyes/intuitions.)
Sorry a bit long questions... Thanks a lot!
Best wishes, Wen