Open mymuli opened 2 years ago
Sorry for getting you latter. A1: Yes, you can understand like that. A2: We can concat the two outputs of the classifier, and calculate the loss with the Ground Truth. Because the classifier is shared, we can thus directly concat the feature and duplicate the labels. Actually, we concat the feature between before BatchFormer and after BatchFormer.
By the way, for moco-v3, it is not necessary to use the two features according to our new experience.
really thanks for your reply,I understand it!thanks very much!
您好! 我想问一下: Q1: 在模型训练的时候,模型最终是几个输出?是 BatchFormer前的classifier + BatchFormer后的classifier 这两个输出呢?然后分别与ground truth计算呢?
Q2: 代码里面,如下所示,这行代码表示的是BatchFormer后的classifier的输出呢?还是对模型两个classifier的输出做concat操作后,最后再与ground truth计算呢? https://github.com/zhihou7/BatchFormer/blob/f6be150ff9fce2ce4bcc3c0f8953dded9d4f273f/moco-v3/moco/builder.py#L23