Closed omega-cw closed 1 year ago
To be clear, there is no training in our framework. During the onboarding and proposal stage, we use the pretrained model provided by SAM (or FastSAM) and DINOv2.
For SAM (or FastSAM), we use its default not prompt setting by sampling 64 points per dimension. There is still a problem of over-segmentation but it can be filtered in the matching stage since their over-segmentation masks have lower confidence score comparing to correct segmentation masks. For example, we show in the teaser the segmentation on YCB-V dataset and most of the over-segmentation masks are filtered out.
Hi, thanks for your nice work! And I have some questions hope you can help me: