janghyuncho / DECOLA

Code release for "Language-conditioned Detection Transformer"
https://arxiv.org/abs/2311.17902
85 stars 4 forks source link

Classifier Weights #11

Open hellerguyh opened 3 months ago

hellerguyh commented 3 months ago

Hey,

Thank you for the fascinating and excellent work.

I have a question regarding the code in your repository. I noticed that the classifier weights are loaded from existing files, such as datasets/metadata/lvis_v1_clip_a+cname.npy. Does this mean that these weights were optimized as part of the training procedure, or are they equivalent to the CLIP embeddings of the corresponding texts?

Thanks

janghyuncho commented 2 months ago

Sorry for late reply! They are just clip embedding (never trained).