acherstyx / CoCap

[ICCV 2023] Accurate and Fast Compressed Video Captioning
https://arxiv.org/abs/2309.12867
MIT License
33 stars 4 forks source link

Pretrained checkpoint #3

Open eunseo-kim97 opened 9 months ago

eunseo-kim97 commented 9 months ago

Thanks for your work! Could you upload the model's pretrained checkpoint file? I want to test with the weights file to caption video input.

Thank you

acherstyx commented 9 months ago

Thanks for your work! Could you upload the model's pretrained checkpoint file? I want to test with the weights file to caption video input.

Thank you

You can download them from here:

jqsun98 commented 3 months ago

Except for pretrained model based on CLIP ViT-B/16, could you please upload the model's pretrained checkpoint file based on ViT-L/14? It achieves much competitive results on MSR-VTT and MSVD benchmarks.

Thank you