Accuracy variation with the CLIP model

arthurdouillard / dytox

Dynamic Token Expansion with Continual Transformers, accepted at CVPR 2022

Apache License 2.0

134 stars 17 forks source link

I am trying to integrate the CLIP model with DyTox. The evaluation accuracy with default config for task=0, after epoch=0, and num_gpus=4 is 60.5%.

When I add a single line _clip_model, _clip_transform = clip.load("ViT-B/32", device=device) the accuracy jumps from 60.5% to 75.0%.

Similarly for _clip_model, _clip_transform = clip.load("ViT-B/16", device=device) the accuracy is 73.5%.

I only instantiate the CLIP model above the scenario_train loop here and did not change anything else.

Do you have any idea what might be causing this issue?

Thanks, Vishal

arthurdouillard / dytox

Accuracy variation with the CLIP model #13