HarborYuan / ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
https://www.mmlab-ntu.com/project/ovsam
Other
914 stars 27 forks source link

Something wrong with the Ilustration? #7

Closed ArsenLuca closed 8 months ago

ArsenLuca commented 8 months ago

image It seems like to be like this?

HarborYuan commented 8 months ago

Hi @ArsenLuca ,

Thanks for your interest in our work.

The direction of this arrow is based on how you understand the arrow. The technical details of the SAM2CLIP module follow the upper right part of this figure (figure 3 in the paper). SAM Encoder, CLIP Encoder, and a neck are needed in the SAM2CLIP process for distillation. The arrow here mainly describes the knowledge transfer direction (from SAM encoder to CLIP encoder + neck) during training. During the inference, the SAM Encoder is no longer needed.

I hope this can help you.

ArsenLuca commented 8 months ago

Hi @ArsenLuca ,

Thanks for your interest in our work.

The direction of this arrow is based on how you understand the arrow. The technical details of the SAM2CLIP module follow the upper right part of this figure (figure 3 in the paper). SAM Encoder, CLIP Encoder, and a neck are needed in the SAM2CLIP process for distillation. The arrow here mainly describes the knowledge transfer direction (from SAM encoder to CLIP encoder + neck) during training. During the inference, the SAM Encoder is no longer needed.

I hope this can help you.

Got it , thanks