-
### Model description
Lorax's official supported models does not list any vision model. This is a big gap for a very successful product.
Having lorax a critical component in our tech stack without …
-
# URL
- https://arxiv.org/abs/2405.02246
# Affiliations
- Hugo Laurençon, N/A
- Léo Tronchon, N/A
- Matthieu Cord, N/A
- Victor Sanh, N/A
# Abstract
- The growing interest in vision-language…
-
### Feature request
The developments in the robotics community around RT-2 show a lot of potential for VLMs but the hardware constraints for small developers makes it difficult to deploy RT-2 level p…
7uk3y updated
9 months ago
-
2BのVision Language Model。llama.cppでは動かないので、ONNXで動かしたい。
https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct
-
- CREPE: https://openaccess.thecvf.com/content/CVPR2023/papers/Ma_CREPE_Can_Vision-Language_Foundation_Models_Reason_Compositionally_CVPR_2023_paper.pdf, https://github.com/RAIVNLab/CREPE
- ARO https…
-
This should allow the core device (RPi) to send an HTTP API request, then call the appropriate API function.
The API body will contain:
```
type: str,
query: str,
media: optional-image
```
Where:
`ty…
-
### Have I written custom code (as opposed to using a stock example script provided in MediaPipe)
Yes
### OS Platform and Distribution
Ubuntu 22.04, arm64, Jetpack 6.0, CUDA 12.2
### Progr…
-
### Motivation
Hi friends,
I'm opening this issue as a place to discuss small vision-language models, please share your thoughts below!
There's recently been great success in research with sm…
-
I realize OpenVINO was originally made for vision models but I'm interested in using OpenVINO for fine-tuning LLMs. It appears there is support to fine-tune for ViT models but not for language models…
-
I am Zhiqiu Lin, a final-year PhD student at Carnegie Mellon University working with Prof. Deva Ramanan. We found your work on NeurIPS'24 fascinating!
I wanted to share [NaturalBench](https://arxiv…