-
### As a visitor I Want to have a quick look at see if it is worth looking further
- Does the language match my philosophy?
- Does the language match my priorities?
- Does the language solve my p…
-
Hi, motivated by the awesome MiniGPT4, we are excited to present Video-LLaMA (https://github.com/DAMO-NLP-SG/Video-LLaMA), a modular video-language pre-training framework that empowers instruction-fol…
-
link text: [VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs](https://arxiv.org/pdf/2306.02858)
Actual title of the paper: Video-LLaMA An Instruction-tuned Audi…
-
### Feature Name
MiniCPM-v2.5
### Feature Description
Research about MiniCPM-v2.5
### Research Findings
MiniCPM-v2.5 is a Chinese language model developed by the Beijing Academy of Artificial Int…
-
Hello. We'd like to introduce our paper "Query-Dependent Video Representation for Moment Retrieval and Highlight Detection (CVPR 2023 Paper)" regarding cross-modal moment retrieval.
Code : https://…
-
Hi there!
Thanks for the effort to maintain this amazing repository.
This is a request to add our recent work on evaluation of Video Models. We propose an evaluation benchmark, _VELOCITI_.
Plea…
-
Hello, thank you for your work. I would like to ask why you think the task of synchronized subtitles is important. How can it help in action generation and action understanding?
-
I can do the following to search for papers: `curl 'https://huggingface.co/api/papers/search?q=attention'`
And I get this:
>[{"id":"2409.07146","title":"Gated Slot Attention for Efficient Linear…
-
To my knowledge, the videos in NExTQA dataset are relatively short, with an average video length of 44 seconds, and there is a noted static bias[1] in the ActivityNet QA dataset. Could you present fur…
-
### Target Platforms
iOS
### SDK Version
Version 1.0 (2.4.0-beta.24.5.30.1)
### Application Name
Adaptive Cards
### Problem Description
**Test Environment:**
Device: iPhone 11
iOS: 17.5.1
Ap…
vagpt updated
3 weeks ago