vlm Search Results - Githubissues

1000+ results
for vlm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Blaizzy/mlx-vlm #131

SmolVLM-Instruct-4bit does not work in gradio UI

There's an error when uploading an image to chat while running `python3 -m mlx_vlm.chat_ui --model mlx-community/SmolVLM-Instruct-4bit` Error: chat_ui.py", line 32, in chat if len(message.file…

msurguy updated 1 day ago
2
TIGER-AI-Lab/VLM2Vec #14

Why use Precision@1 instead of Recall@K as a metric?

I want to compare the performance differences between VLM-vec, MM-Embed, and UniIR on retrieval task. I just find that data for the retrieval task is the same in both MM-Embed and M-BEIR

saicoco updated 5 hours ago
1
open-compass/VLMEvalKit #605

Error: module 'torch.library' has no attribute 'register_fak…

After I install it, I tried running this demo command. But I get errors: ``` # Demo from vlmeval.config import supported_VLM model = supported_VLM['idefics_9b_instruct']() # Forward Single Imag…

tjasmin111 updated 4 days ago
1
rubato-yeong/comments #5

multimodal/prism/

# [24’ ICML] Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models - Blog by rubatoyeong Find Directions [https://rubato-yeong.github.io/multimodal/prism/](https://…

utterances-bot updated 3 days ago
1
Blaizzy/mlx-vlm #39

Models to port to MLX-VLM

- [x] MiniCPM-Llama3-V-2_5 - [x] Florence 2 - [x] Phi-3-vision - [x] Bunny - [x] Dolphi-vision-72b - [x] Llava Next - [x] Qwen2-VL - [x] Pixtral - [x] Llama-3.2 - [x] Llava Interleave - [x] …

Blaizzy updated 2 days ago
25
Blaizzy/mlx-vlm #80

Which specific models work with this framework?

This is a nice framework to use for image analysis / captioning, etc. Is there a doc somewhere that sets out which models, specifically can be driven through this app/library? When you say "Pixtra…

jrp2014 updated 1 month ago
25
concept-graphs/concept-graphs #74

Questions for running slam/r3d_stream_rerun_realtime_mapping…

Hi, I have troubles running the slam/r3d_stream_rerun_realtime_mapping.py file. I'm using the code from ali-dev branch and I've modified the DemoApp structure so the input depth and rgb images are ob…

percypeng5221 updated 2 weeks ago
2
opea-project/GenAIExamples #561

Update VisualQnA example with Falcon VLM

Update VisualQnA example that uses Falcon VLM. This would require to include Falcon as part of the validation at https://github.com/opea-project/GenAIComps/tree/main/comps/llms. And then create an …

arun-gupta updated 3 weeks ago
7
sangminwoo/awesome-vision-and-language #13

Add a CVPR 2024 paper

Could you add our CVPR 2024 paper about vision-language pertaining, "Iterated Learning Improves Compositionality in Large Vision-Language Models", into this repo? Paper link: https://arxiv.org/abs/…

hellomuffin updated 4 days ago
1
huggingface/trl #2136

[SFT VLM] Add support for Molmo models

### Feature request Extend the `sft_vlm.py` script to support the new Molmo models from AllenAI: https://huggingface.co/collections/allenai/molmo-66f379e6fe3b8ef090a8ca19 Paper: https://arxiv.org/…

lewtun updated 1 month ago
15

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for vlm

1000+ results
for vlm