yuranusduke / CALIP

Unofficial implementation of CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention
2 stars 0 forks source link

About the result and some issues #1

Open SY-Xuan opened 2 years ago

SY-Xuan commented 2 years ago

Dear sir, Thanks for your re-implementation. I have some problems with your implementation. a) Does this code re-produce the experimental results reported in the original paper? b) I have some questions about your implementation.

yuranusduke commented 2 years ago

Hello,

Thank for your interest on the code. To answer your questions:

  1. I tried on Caltech101 and ImageNet, results can be reproduced.
  2. I change code in original CLIP since there is ambiguity that was not mentioned in the paper: how to do vector dimensions alignment, I use simple avg pooling, you can try other methods.
  3. Since outputs from CLIP are normalized, and I did not find whether authors in the paper normalize, I didn’t do normalization, if you find, I’d appreciate you post your modification here.
SY-Xuan commented 2 years ago

Thanks for your reply.