vision-language-model Search Results

1000+ results
for vision-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

langgenius/dify #7707

token count is incorrect in vision mode

### Self Checks - [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general). - [X] I have s…

yuxizhe updated 6 days ago
1
vllm-project/vllm #9069

[Bug]: Issue with Pixtral Model: Unsupported Vision Configur…

### Your current environment Issue with Pixtral Model: Unsupported Vision Configuration in vLLM (AMD Radeon 7900 XTX) I am trying to load the Pixtral model from Hugging Face (specifically, mistr…

matrix1233 updated 4 hours ago
1
fulfulggg/Information-gathering #224

FIDAVL: Vision-Languageモデルを用いた偽画像の検出と帰属

## タイトル: FIDAVL: Vision-Languageモデルを用いた偽画像の検出と帰属 ## リンク: https://arxiv.org/abs/2409.03109 ## 概要: arXiv:2409.03109v1 発表タイプ: 新規概要: 本稿では、Vision-Languageモデルを用いた偽画像の検出と帰属を行うFIDAVL (Fake Image Detectio…

fulfulggg updated 3 weeks ago
2
arakoodev/EdgeChains #92

Gradient-Regulated Meta-Prompt Learning for Generalizable Vi…

https://arxiv.org/abs/2303.06571

sandys updated 9 months ago
3
ollama/ollama #4257

Support for InternVL-Chat-V1.5

https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5 We introduce InternVL 1.5, an open-source multimodal large language model (MLLM) to bridge the capability gap between open-source and proprietary…

wwjCMP updated 1 week ago
5
google-ai-edge/mediapipe #5609

INTERNAL: Service "kGpuService", required by node mediapipe_…

### Have I written custom code (as opposed to using a stock example script provided in MediaPipe) No ### OS Platform and Distribution iOS 16.4 and iOS 16.6 ### MediaPipe Tasks SDK version 0.10.15…

V-m1r updated 4 weeks ago
1
mbzuai-oryx/LLaVA-pp #18

Can use Lora+base model. but for merging Lora+base is error

Lora+base is working good ![image](https://github.com/mbzuai-oryx/LLaVA-pp/assets/15274284/ccec0900-7db0-4729-9ab4-3c5f68e0f304) ![image](https://github.com/mbzuai-oryx/LLaVA-pp/assets/15274284/7d12…

hellangleZ updated 5 months ago
4
clembench/clembench-leaderboard #12

add filter on leaderboard

We could add filters to the leaderboard, similar to what we have for the plots. Could be even more complex, and lead to a re-ordering of the leaderboard.. Basically, could use all parameters that we a…

davidschlangen updated 1 month ago
2
paperswithlove/papers-we-read #14

Mini-Gemini: Mining the Potential of Multi-modality Vision L…

![image](https://github.com/paperswithlove/papers-we-read/assets/100809463/602058a1-017f-4f10-91fc-fab580e54c5b) - 전체+분할 Low Res. Encoder화 High Res. Dual Encoder까지!!! ![image](https://github.com…

blacklleye updated 6 months ago
1
codezakh/LilT #3

Release of pre-trained models

Hi, This is a really valuable work, in particular I really like that you have results for various degrees of finetuning of language and vision encoders. I am interested in evaluating some of y…

rohit-gupta updated 2 weeks ago
5

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for vision-language-model

1000+ results
for vision-language-model