Open momo1986 opened 3 months ago
I think this is the expected result. See Figure 7 of the paper https://arxiv.org/pdf/2310.13076 ViT14x2 is less robust to larger patches. In your experiment setting, the feature size is 7, the mask size is 4. The certification condition of the double-masking algorithm is to have two-mask correctness. i.e., the model prediction on all possible two-mask predictions should be correct. In this case, two masks could remove 4*2=8 features, which are larger than the entire feature map size. So it is expected to see zero robustness.
Hi Xiang,
I think you have done a wonder task.
I try the program.
However, I met some issues.
I follow your guidance to put the data and download the weights.
However, on the patchcure_vitsrf14x2_masked scenario, the result is not very obvious:
~
Does it indicate the potential bug?
Thanks & Regards! Momo