Closed ekorman closed 3 years ago
Hi,
As shown above, we extract multiple representations for evaluation:
g o f
): embedding after projection headf
): embedding before projection headAs mentioned in Section 3.1, we use f
as an embedding for test time unless otherwise specified as in Figure 4.
ah got it, thanks for the quick response!
If I understand the code correctly, then the experiments with projection heads (e.g.
script/run_contrastive_da.sh
) use the projection head at test time and not just training time. This is from looking atresnet_util.ResNet
which looks like the outputembeds
is always the projection head. So, in the notation of the paper (e.g. figure 4), the contrastive results generated by this repo are allContrastive (DA) g o f
So then in table 2 of the paper where it says
Contrastive (DA)
that refers to usingg o f`` since the script
script/run_contrastive_da.sh` is supposed to reproduce that row?That's a little confusing to me since table 4 shows that the best results are obtained without using the projection head at test time (i.e. in the second stage).
Is my understanding correct?