-
The HowTo100M + VidChapters-7M + ViTT model is performing poorly on dense video captioning.
Reproduction:
Run
```
yt-dlp -P $TRANSFORMERS_CACHE -o video.mp4 https://www.youtube.com/watch?v=WJ…
-
I read in the readme file, paligemma can captioning a short video, anyone can guide me to do that?
Does it extract every frames on the video? Or does the paligemma tokenizer directly support video…
-
Hey @Ino-Ichan
Thx so much for your work!
does GIT-LLM support video as input as original GIT2?
-
The Lightning talks usually consist of multiple talks by different speakers about different topics. I think we should split up the lightning talks video into multiple individual "talks" on RubyVideo. …
-
Thanks to the awesome work!
I'm interested in video captioning, and can you share the captioning checkpoint?
Thanks a lot
-
Hi there,
I'm looking for information regarding the format of the RAW ANC data:
--anc captions.raw
If I want to embed 608/708 closed captioning as ANC data with no video or audio, is this possi…
-
Hi, thanks for your great work!
I'm checking at the new released model internVideo2, it's interesting!
I saw demo.ipynb files in multi_modality folder, it can calculate text prob.
I'm wondering if …
-
@iory could you add video captioning node ?
fyi: @a-ichikura
-
### design-system-website
### Expected Behavior
A viewer should be able to turn on and turn off the closed captioning while the video is playing.
### Actual Behavior
The closed capti…
-
Hello, thank you for your work. I would like to ask why you think the task of synchronized subtitles is important. How can it help in action generation and action understanding?