muzairkhattak / ViFi-CLIP

[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
https://muzairkhattak.github.io/ViFi-CLIP/
MIT License
248 stars 18 forks source link

Inference code #4

Closed LilyTheBear closed 10 months ago

LilyTheBear commented 1 year ago

Would you mind to add a sample inference code ?

I wrote one it had way to many issues with half vs float tensors. I took care of it but then it had many other issues. All fixed but now the tensor sizes dont match.

Running the eval code based on main.py and the command line you suggested results into many issues with apax. Is there an easy way to just run it on a sample video with 16 frames and a text sentence?

tomer196 commented 1 year ago

Anyone can help with this kind of script?

muzairkhattak commented 1 year ago

Hi, sorry for the delayed reply.

We will share a sample script / jupyter notebook in upcoming days.

Thank you!

muzairkhattak commented 11 months ago

Hi @LilyTheBear, @tomer196, @odh1995, @dahlian00 and @ooza,

We have released a Jupyter Notebook containing simple code to perform ViFi-CLIP inference on custom video. You can directly find it at this link..

The good news is that it does not require any installation of Apex libaray.

Let me know in case there are any further queries!

Thank you and kind regards!