-
Can you provide a demo of captioning, which is to input a folder containing a large number of videos and output the caption of the video?
-
## Project Request
Video Captioning with Deep Learning
The Video Captioning with Deep Learning project focuses on developing a model that automatically generates descriptive captions for videos.…
-
Hi,
I'm deeply inspired by your great work!
Could you please provide some information on the data used to evaluate the detailed captioning ability of the model (not the evaluation script, but th…
-
Hi! Thanks for your great work! I am curious about how to get multi-view object images in the "Object Caption" step of your annotation pipeline. It seems that only a 3D point cloud and object bounding…
-
Hello,
I appreciate your excellent work and have a question regarding the testing process, specifically on how to ensure proper testing without falling into the trap of overfitting.
We conducted…
-
![ms_1021](https://cloud.githubusercontent.com/assets/4707540/19177221/10440040-8c15-11e6-85c9-73c0fbe1d358.png)
-
![ms_1023](https://cloud.githubusercontent.com/assets/4707540/19177230/1e91cc18-8c15-11e6-9dd6-211e4fb46a16.png)
-
How might EasyAnimate slice a 1080p video? Or more specifically what is the frame interval of which the slicing happens? Assuming this is the memory requirements for resolutions lower than 1080p.
E…
-
Hi,
Impressive work! I want to ask how to extract features from my own video-text datasets for finetuning model?
-
Congrats! It's a nice work for zero-shot captioning.
In the paper, zero-shot video captioning results on MSR-VTT, Activity-Net, etc. have been reported. But from the this repo, I couldn't find codes…