Hello,
First, thank you for this paper and code!
I want to adapt you approach to use Supervised Contrastive Learning after a regional proposal network.
In this context, I was wondering, as it's not explicitly said in the paper: what are your exact motivations for including two views of the images in each batch?
From my understanding/intuition, it's to ensure that the anchor is exposed to a "decent"/minimum number of positive samples, is that right?
My understanding is that to make sure that there exist positives in the multiviewed batch, otherwise, the numerator in the loss formula (Eq. 2/3) would be 0.
Hello, First, thank you for this paper and code! I want to adapt you approach to use Supervised Contrastive Learning after a regional proposal network.
In this context, I was wondering, as it's not explicitly said in the paper: what are your exact motivations for including two views of the images in each batch? From my understanding/intuition, it's to ensure that the anchor is exposed to a "decent"/minimum number of positive samples, is that right?
Thank you!