facebookresearch / omnivore

Omnivore: A Single Model for Many Visual Modalities
Other
559 stars 39 forks source link

EK100 top 1 acc question #40

Closed potatowarriors closed 1 year ago

potatowarriors commented 1 year ago

your work is cool !! I have a question while looking at the paper. There is a difference between the OMNIVORE EK100 top 1 acc in table 3 and the top 1 acc in table 6 at 47.4 vs 49.9. Are both models Swin-B models and pre-trained in the same way? Where does the 2.5% performance difference come from?

mannatsingh commented 1 year ago

Hi @potatowarriors the difference in performance is from the fact that for the results in Table 6 we additionally pretrain on ImageNet-21K.

potatowarriors commented 1 year ago

Thanks for the kind explanation, my question is solved! I'll close the issue