-
### Checked
- [x] I searched existing ideas and did not find a similar one
- [x] I added a very descriptive title
- [x] I've clearly described the feature request and motivation for it
### Featu…
-
Hello, first of all, I congratulate you for your success in the field of video production. I have a question for you. When I run the cogvideo-5b model in my local and when I run the same prompt in hug…
-
- [x] basic gemini sample (gemini sample)
- [x] basic vertex sample (gemini sample)
- [x] custom llm sample (gemma2 sample)
- [x] system prompt usage (recipes sample)
- [x] process json from LLM r…
-
More docs:
qwen2-vl: https://github.com/modelscope/ms-swift/blob/main/docs/source/Multi-Modal/qwen2-vl%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.md
qwen1.5: https://github.com/modelscope/ms-swift/blob…
-
Thank you for your excellent work. I have some questions that I hope to receive your answers to. I hope to apply TFVTG to my custom video dataset to test the video temporal grounding function. What sh…
-
I tried to run Aria video notebook with vllm 0.6.3 but I got the following error. Can you check?
# load Aria model & tokenizer with vllm
import os
os.environ["CUDA_VISIBLE_DEVICES"] = "0"
im…
-
### Project Name
Property LLM
### Description
### Motivation
Many London university students struggle to find suitable accommodation, particularly when it comes to meeting the specific requi…
-
Some video LLMs are also based on LLaVA codebase, e.g., Video-ChatGPT.
-
### Project Name
Quiz Maker
### Description
Quiz Maker is a GenAI tool that uses RAG to generate quiz on the fly based on content uploaded. It is an ASP.NET web application that utilises Sema…
-
Could you please confirm which video LLM is most suited for long videos from this list: https://huggingface.co/collections/OpenGVLab/internvideo2-6618ccb574bd2f91410df5cd
my guess is [InternVideo2-…