Thank you so much for the fantastic code!
could you please share the center frame of snippets feature extracted by the image encoder of CLIP if you are convenient
@gdg452
We publish the code for our new work: https://github.com/UARK-AICV/VLTinT
In the repo, there are more details about how to extract features. I hope this is helpful to you.
Thank you so much for the fantastic code! could you please share the center frame of snippets feature extracted by the image encoder of CLIP if you are convenient