Closed lchen1019 closed 5 months ago
The code is outdated. Please stay tuned for updates!
Hi, we just updated the code, and you can check that the same line is updated as "transformer" from "visual", which takes effect on both transformer encoders of CLIP. Please let us know if you have further questions about the code!
Hi, we just updated the code, and you can check that the same line is updated as "transformer" from "visual", which takes effect on both transformer encoders of CLIP. Please let us know if you have further questions about the code!
That's great! Thank you for your excellent work.
The framework image shows that the text encoder has been fine-tuned in , but not in the code. Is this picture drawn wrong?