video-captioning-model Search Results

antoyang/VidChapters #22

Pretrained models performing poorly on dense video captionin…

The HowTo100M + VidChapters-7M + ViTT model is performing poorly on dense video captioning. Reproduction: Run ``` yt-dlp -P $TRANSFORMERS_CACHE -o video.mp4 https://www.youtube.com/watch?v=WJ…

lawrenceztang updated 2 months ago

BradyFU/Awesome-Multimodal-Large-Language-Models #184

Add SALMONN, video-SALMONN, video-SALMONN 2

TCL606 updated 2 weeks ago

CAMMA-public/SurgVLP #4

How to perform the task of generating video captions

Great work! How to perform the task of generating video captions？

cascat0 updated 6 days ago

Vision-CAIR/LongVU #15

LongVU pretrained models for video features extraction

Thank you for sharing this amazing work, we are interested in trying the extracted semantics of your trained models from videos related to action classification. However, we are not sure if this is po…

mustafahalimeh updated 5 days ago

yonseivnl/vlm-rlaif #10

Trying Inference on TempCompass benchmark

I am getting "" as output, can't seem to figure out the issue. ``` import torch from videollava.conversation import conv_templates, SeparatorStyle from videollava.model.builder import load_pre…

yogkul2000 updated 3 weeks ago

cvat-ai/cvat #4046

Support for (grounded) image captioning

### My actions before raising this issue - [x] Read/searched [the docs](https://github.com/opencv/cvat/tree/master#documentation) - [x] Searched [past issues](/issues) Feature request: recent…

jbohnslav updated 3 weeks ago

OpenGVLab/InternVideo #104

is there a demo code for video QA and video Captioning?

Hi, thanks for your great work! I'm checking at the new released model internVideo2, it's interesting! I saw demo.ipynb files in multi_modality folder, it can calculate text prob. I'm wondering if …

LanHao0 updated 3 months ago

orcaman/improving_whisper_transcriptions_with_gpt4o #2

Request for Subtitle Generation Feature in Addition to Trans…

I am currently working on improving video transcriptions using the OpenAI API and have successfully integrated a solution that enhances transcription accuracy. However, I believe that extending the fu…

fralapo updated 1 month ago

snap-research/Panda-70M #19

RuntimeError: PytorchStreamReader failed reading zip archive…

I run the captioning file. `python inference.py --video-list inputs/video_list.txt --prompt-list inputs/prompt_list.txt` and encountered the following issues `/root/anaconda3/envs/panda70m_captio…

ZhangScream updated 8 months ago

comet-ml/opik #567

[FR]: Enable Opik to display additional media formats, inclu…

### Proposal summary ## Feature Request Enable Opik to display additional media formats, including audio, PDF, and video. ## Background Opik currently supports only image display, which li…

pleomax0730 updated 5 days ago

406 results for video-captioning-model

406 results
for video-captioning-model