dhg-wei / DeCap

ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
127 stars 7 forks source link

Pretrained models on video caption #7

Open jqsun98 opened 1 year ago

jqsun98 commented 1 year ago

Congrats! It's a nice work for zero-shot captioning. In the paper, zero-shot video captioning results on MSR-VTT, Activity-Net, etc. have been reported. But from the this repo, I couldn't find codes and pretraine models to perform such repreductions. I'd like to know whether these models and instructions on video caption will be relelased.

Thanks a lot!

tian1327 commented 1 year ago

Great work! It would be great if the pre-trained models along with inference code can be released.