Open HypnosXC opened 2 years ago
Can you provide the few shot results of different few shot settings of the 11 dataset , with the vit-B image backbone CLIP? I tried the settings in the paper but some results can not be achieved (food , for instance )
I am wondering whether the result you get needs early stopping strategy? Or just the final output?
Can you provide the few shot results of different few shot settings of the 11 dataset , with the vit-B image backbone CLIP? I tried the settings in the paper but some results can not be achieved (food , for instance )