-
I run demo on the activitynet caption dataset v__8Zk9dfBgPg.mp4. The results seems to be poor:
```
0000.0 - 0029.0 seconds, A woman introduces the art of stenciling, explaining its technique and …
-
Hi,
I tried to perform inference on my own videos by simply putting those videos in the /visualization/videos folder, then running the provided scripts in this repo.
However, when loading the mod…
-
## Project Request
Video Captioning with Deep Learning
The Video Captioning with Deep Learning project focuses on developing a model that automatically generates descriptive captions for videos.…
-
Hello,I'm currently testing your code on the activityNet dataset, specifically the vc task. During the run, I noticed that the only evaluation metrics provided are meteor, cider, and bleu. However, I …
-
Thank you for your excellent work. However, the data download links for frames or raw videos on the official ActivityNet-Captions website are no longer working. Could you please provide the video data…
-
大哥,我有一个问题,您在测试text-to-video方向的任务的时候,Rank值得计算是建立在多个事件的基础上,还是单个事件得基础上呀。你在训练的时候使用的多个事件文本作为查询,您测试的指标rank也是建立在多事件的基础上吗,跪求大哥指点
![d9a63575dd8af3522d8b2b2931d3657](https://github.com/user-attachments/assets/…
-
Hi @gyxxyg ,
I am running an evaluation code for ActivityNet for 3.7K validation videos, however, the inference time is quite large. I wonder if you can share the inference time and any suggestions…
-
It seems a nice work. I wanted to test it on custom input videos. It would be very helpful if you can provide a script for generating video captions for a raw input video.
-
Thanks for your impressive paper.
In the paper, you said "In this stage, we select a subset from ActivityNet Captions [12] and DiDeMo [1] datasets" in stage 3.
However, I thought that the modal mig…
-
Do you have any training techniques? I have conducted three training experiments and only achieved half of the official accuracy rate