language-vision Search Results

1000+ results
for language-vision

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

predibase/lorax #637

Phi 3.5 vision (4B model)

### Model description Lorax's official supported models does not list any vision model. This is a big gap for a very successful product. Having lorax a critical component in our tech stack without …

CheeseAndMeat updated 3 weeks ago
2
AkihikoWatanabe/paper_notes #1434

What matters when building vision-language models?, Hugo Lau…

# URL - https://arxiv.org/abs/2405.02246 # Affiliations - Hugo Laurençon, N/A - Léo Tronchon, N/A - Matthieu Cord, N/A - Victor Sanh, N/A # Abstract - The growing interest in vision-language…

AkihikoWatanabe updated 1 month ago
1
predibase/lorax #179

vision language model support

### Feature request The developments in the robotics community around RT-2 show a lot of potential for VLMs but the hardware constraints for small developers makes it difficult to deploy RT-2 level p…

7uk3y updated 9 months ago
2
axinc-ai/ailia-models #1566

ADD Qwen2 VL

2BのVision Language Model。llama.cppでは動かないので、ONNXで動かしたい。 https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct

kyakuno updated 5 days ago
6
LAION-AI/CLIP_benchmark #97

Add compositionality benchmarks

- CREPE: https://openaccess.thecvf.com/content/CVPR2023/papers/Ma_CREPE_Can_Vision-Language_Foundation_Models_Reason_Compositionally_CVPR_2023_paper.pdf, https://github.com/RAIVNLab/CREPE - ARO https…

mehdidc updated 3 days ago
8
USFCA-CS490-Micron/private-relay #1

Core Device API Endpoint

This should allow the core device (RPi) to send an HTTP API request, then call the appropriate API function. The API body will contain: ``` type: str, query: str, media: optional-image ``` Where: `ty…

abkslm updated 3 weeks ago
1
google-ai-edge/mediapipe #5690

GPU mode (all tasks) fails to initialize on Nvidia Jetson (a…

### Have I written custom code (as opposed to using a stock example script provided in MediaPipe) Yes ### OS Platform and Distribution Ubuntu 22.04, arm64, Jetpack 6.0, CUDA 12.2 ### Progr…

JC3 updated 1 week ago
3
InternLM/lmdeploy #1748

[Feature] Support for compact Vision-Language models

### Motivation Hi friends, I'm opening this issue as a place to discuss small vision-language models, please share your thoughts below! There's recently been great success in research with sm…

vody-am updated 4 months ago
3
openvinotoolkit/training_extensions #4084

LoRA or QLoRA for LLMs?

I realize OpenVINO was originally made for vision models but I'm interested in using OpenVINO for fine-tuning LLMs. It appears there is support to fine-tune for ViT models but not for language models…

epage480 updated 4 days ago
1
xing0047/cca-llava #2

Great work! And sharing our NaturalBench

I am Zhiqiu Lin, a final-year PhD student at Carnegie Mellon University working with Prof. Deva Ramanan. We found your work on NeurIPS'24 fascinating! I wanted to share [NaturalBench](https://arxiv…

linzhiqiu updated 1 week ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for language-vision

1000+ results
for language-vision