video-language-understanding Search Results

1000+ results
for video-language-understanding

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

c3lang/c3-web #38

Planning docs design - Different flows suit different kinds …

### As a visitor I Want to have a quick look at see if it is worth looking further - Does the language match my philosophy? - Does the language match my priorities? - Does the language solve my p…

joshring updated 1 day ago
2
Vision-CAIR/MiniGPT-4 #214

Video-LLaMA: An Instruction-Finetuned Visual Language Model…

Hi, motivated by the awesome MiniGPT4, we are excited to present Video-LLaMA (https://github.com/DAMO-NLP-SG/Video-LLaMA), a modular video-language pre-training framework that empowers instruction-fol…

hangzhang-nlp updated 1 year ago
2
DAMO-NLP-SG/VideoLLaMA2 #2

The title of the paper behind the link is not that of the li…

link text: [VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs](https://arxiv.org/pdf/2306.02858) Actual title of the paper: Video-LLaMA An Instruction-tuned Audi…

PromptExpert updated 3 months ago
3
swarmauri/swarmauri-sdk #293

[Feature Research]: MiniCPM-v2.5

### Feature Name MiniCPM-v2.5 ### Feature Description Research about MiniCPM-v2.5 ### Research Findings MiniCPM-v2.5 is a Chinese language model developed by the Beijing Academy of Artificial Int…

abdulsamodazeez updated 1 week ago
1
danieljf24/awesome-video-text-retrieval #6

Hello. Introduce a paper

Hello. We'd like to introduce our paper "Query-Dependent Video Representation for Moment Retrieval and Highlight Detection (CVPR 2023 Paper)" regarding cross-modal moment retrieval. Code : https://…

wjun0830 updated 3 weeks ago
1
yunlong10/Awesome-LLMs-for-Video-Understanding #10

Requesting to add a benchmark to this repo - VELOCITI

Hi there! Thanks for the effort to maintain this amazing repository. This is a request to add our recent work on evaluation of Video Models. We propose an evaluation benchmark, _VELOCITI_. Plea…

varungupta31 updated 2 months ago
1
rd20karim/M2T-Segmentation #3

why the task is important

Hello, thank you for your work. I would like to ask why you think the task of synchronized subtitles is important. How can it help in action generation and action understanding?

xiaoxiaostudy updated 1 week ago
1
huggingface/huggingface_hub #2553

[Feature request] Papers API

I can do the following to search for papers: `curl 'https://huggingface.co/api/papers/search?q=attention'` And I get this: >[{"id":"2409.07146","title":"Gated Slot Attention for Efficient Linear…

nbroad1881 updated 22 minutes ago
3
LLaVA-VL/LLaVA-NeXT #3

Request for NExTQA Dataset Evaluation Prompt and More Result…

To my knowledge, the videos in NExTQA dataset are relatively short, with an average video length of 44 seconds, and there is a noted static bias[1] in the ActivityNet QA dataset. Could you present fur…

patrick-tssn updated 4 months ago
1
microsoft/Teams-AdaptiveCards-Mobile #169

[Adaptive Card iOS - ProductVideo.json]: Mute or Unmute butt…

### Target Platforms iOS ### SDK Version Version 1.0 (2.4.0-beta.24.5.30.1) ### Application Name Adaptive Cards ### Problem Description **Test Environment:** Device: iPhone 11 iOS: 17.5.1 Ap…

vagpt updated 3 weeks ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for video-language-understanding

1000+ results
for video-language-understanding