Almost doubles the speed of training, no significant impact on results (checked for Flickr8k and Flickr30k).
Semantically, the only difference is that all possible contrastive pairs within a minibatch are used, where before it was ncon pairs, with default ncon = 100.
Almost doubles the speed of training, no significant impact on results (checked for Flickr8k and Flickr30k).
Semantically, the only difference is that all possible contrastive pairs within a minibatch are used, where before it was
ncon
pairs, with defaultncon = 100
.