XFeiF / ComputerVision_PaperNotes

📚 Paper Notes (Computer vision)
1 stars 0 forks source link

21ICCV # Emerging Properties in Self-Supervised Vision Transformers (DINO) #44

Open XFeiF opened 2 years ago

XFeiF commented 2 years ago

Paper
Code

Authors:
Mathilde Caron, Hugo Touvron, etc.
FBAI.

Highlights:

XFeiF commented 2 years ago

They look at the self-attention of the [CLS] token on the heads of the last layer. This token is not attached to any label nor supervision. These maps show that the model automatically learns class-specific features leading to unsupervised object segmentations.

SSL segmentation magic -> SSL + Transformer's [CLS] token.