Closed travellingsasa closed 2 months ago
Hi!!
It's contrastive fine-tuning, we use the same task CLIP was trained on. All unfrozen.
Let me know if you need more details!
So when you say "same task CLIP was trained on" do I correctly assume you continued training without adding a classifier?
Yup, we keep the same contrastive pre-training objective
Thank you for the clarification and the super quick reply :)
Happy to help!!
On Tue, Apr 30, 2024, 14:14 travellingsash @.***> wrote:
Thank you for the clarification and the super quick reply :)
— Reply to this email directly, view it on GitHub https://github.com/patrickjohncyh/fashion-clip/issues/32#issuecomment-2087342972, or unsubscribe https://github.com/notifications/unsubscribe-auth/AARBSS6EQCFIX7EIF344DW3ZAACRRAVCNFSM6AAAAABG7C3GB2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDAOBXGM2DEOJXGI . You are receiving this because you commented.Message ID: @.***>
Hey there,
I am wondering how you did the fine-tuning here. You do not describe it in the paper.
Did you
I don't think you did 2 or 3 since you used full sentences as captions.
How did you do it?
All the best