Open paihuai00 opened 1 year ago
When applying CQL to GEN-VLKT, we use CLIP weight to initialize the category queries. I will upload the code for this part after finishing a recent deadline :smiley:
When applying CQL to GEN-VLKT, we use CLIP weight to initialize the category queries. I will upload the code for this part after finishing a recent deadline 😃
How's it going?
Thank you for your work. It has been three months. Can you release the code for GENVLKT+CQL now? I'm having trouble with its performance.
@Charles-Xie Hello, I would like to inquire whether the CLIP weight used to initialize the category queries is derived from the HOI triplet text or just from interaction text, and how? What is the coefficient for the image loss in GEN? Could you update some code examples?
When applying the CQL method to GEN-VLKT, how should we handle the CLIP weight for verb classification?
Can you provide the relevant code?