video-language-model Search Results

1000+ results
for video-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ollama/ollama #5276

Support for Vision Language Models that can process Videos.

Is it possible to support loading the VLM's like VideoLLama, Chat-UniVi models that can process Videos?

manishkumart updated 1 month ago
1
clembench/clembench-leaderboard #12

add filter on leaderboard

We could add filters to the leaderboard, similar to what we have for the plots. Could be even more complex, and lead to a re-ordering of the leaderboard.. Basically, could use all parameters that we a…

davidschlangen updated 3 days ago
2
m-bain/whisperX #844

OAI Whisper transcribes correctly but whisperx returns `No a…

I'm getting poor transcription results using whisperx, specifically I am sometimes not getting any transcription out of some short videos, whereas OpenAI's official whisper model transcribes them corr…

reasv updated 1 week ago
9
microsoft/RAG_Hack #57

Project: Talk to Your Docs with AI! LangChain & OpenAI in Py…

### Project Name Talk to Your Docs with AI! ### Description # ☕️ Chat with AI (and optionally your document) This Streamlit application allows users to chat with AI and optionally upload documen…

apiasak updated 2 hours ago
1
PKU-YuanGroup/Video-LLaVA #184

ImportError: cannot import name '_expand_mask' from 'transfo…

**scenes：** CLI Inference **command：** CUDA_VISIBLE_DEVICES=0 python3 -m videollava.serve.cli --model-path "/root/Video-LLaVA-7B" --file "/root/videos/8132-207209040_small.mp4" --load-4bit **i…

qiuchen001 updated 3 weeks ago
3
Nixtla/nixtla #455

[Feature Request: Support for Non-Tabular Data (Images, Text…

### Description > I would like to request the extension of the Time-Series Foundation Model to support non-tabular data types, such as RGB images, text, and sound. This would allow the model to handl…

linkedlist771 updated 1 week ago
2
ultralytics/yolov5 #13269

Video inference with YOLOv5 model in python

### Search before asking - [X] I have searched the YOLOv5 [issues](https://github.com/ultralytics/yolov5/issues) and [discussions](https://github.com/ultralytics/yolov5/discussions) and found no si…

mayurkatre18 updated 2 weeks ago
3
openvinotoolkit/openvino #26380

[Performance]: The quantized full-connected network has no s…

### OpenVINO Version 2024.2.0-15519-5c0f38f83f6-releases/2024/2 ### Operating System Ubuntu 22.04 (LTS) ### Device used for inference CPU ### OpenVINO installation PyPi ### Programming Languag…

eekarot updated 3 days ago
1
openvinotoolkit/openvino #26264

[Performance]: inference takes too long on simple tasks

### OpenVINO Version 2021.2.1.0 ### Operating System Windows System ### Device used for inference CPU ### OpenVINO installation Build from source ### Programming Language C++ ### Hardware Ar…

xueyingxin updated 1 week ago
1
NVlabs/VILA #109

Issue with Flash Attention on V100 GPU for Llama-3-VILA1.5-8…

Hi, I am encountering an issue when running inference on the Llama-3-VILA1.5-8B model. The error message I receive is: ```RuntimeError: FlashAttention only supports Ampere GPUs or newer.``` I…

vedernikovphoto updated 1 day ago
8

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for video-language-model

1000+ results
for video-language-model