mhamilton723 / STEGO

Unsupervised Semantic Segmentation by Distilling Feature Correspondences
MIT License
711 stars 142 forks source link

Question about Training Methodology in STEGO #92

Closed rd1886 closed 3 months ago

rd1886 commented 5 months ago

Hi,

I have a query regarding the training process in STEGO. In DINO, the training approach involves using a fixed backbone and training only the interchangeable head. I'm wondering if the same principle applies to STEGO. Specifically, during training on custom dataset, are the ViT-Base / ViT-Small components frozen, similar to how the backbone is treated in DINO?

Thanks!