-
Is it possible to support loading the VLM's like VideoLLama, Chat-UniVi models that can process Videos?
-
We could add filters to the leaderboard, similar to what we have for the plots. Could be even more complex, and lead to a re-ordering of the leaderboard.. Basically, could use all parameters that we a…
-
I'm getting poor transcription results using whisperx, specifically I am sometimes not getting any transcription out of some short videos, whereas OpenAI's official whisper model transcribes them corr…
-
### Project Name
Talk to Your Docs with AI!
### Description
# ☕️ Chat with AI (and optionally your document)
This Streamlit application allows users to chat with AI and optionally upload documen…
-
**scenes:**
CLI Inference
**command:**
CUDA_VISIBLE_DEVICES=0 python3 -m videollava.serve.cli --model-path "/root/Video-LLaVA-7B" --file "/root/videos/8132-207209040_small.mp4" --load-4bit
**i…
-
### Description
> I would like to request the extension of the Time-Series Foundation Model to support non-tabular data types, such as RGB images, text, and sound. This would allow the model to handl…
-
### Search before asking
- [X] I have searched the YOLOv5 [issues](https://github.com/ultralytics/yolov5/issues) and [discussions](https://github.com/ultralytics/yolov5/discussions) and found no si…
-
### OpenVINO Version
2024.2.0-15519-5c0f38f83f6-releases/2024/2
### Operating System
Ubuntu 22.04 (LTS)
### Device used for inference
CPU
### OpenVINO installation
PyPi
### Programming Languag…
-
### OpenVINO Version
2021.2.1.0
### Operating System
Windows System
### Device used for inference
CPU
### OpenVINO installation
Build from source
### Programming Language
C++
### Hardware Ar…
-
Hi,
I am encountering an issue when running inference on the Llama-3-VILA1.5-8B model. The error message I receive is:
```RuntimeError: FlashAttention only supports Ampere GPUs or newer.```
I…