-
### Self Checks
- [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general).
- [X] I have s…
-
### Your current environment
Issue with Pixtral Model: Unsupported Vision Configuration in vLLM (AMD Radeon 7900 XTX)
I am trying to load the Pixtral model from Hugging Face (specifically, mistr…
-
## タイトル: FIDAVL: Vision-Languageモデルを用いた偽画像の検出と帰属
## リンク: https://arxiv.org/abs/2409.03109
## 概要:
arXiv:2409.03109v1 発表タイプ: 新規
概要: 本稿では、Vision-Languageモデルを用いた偽画像の検出と帰属を行うFIDAVL (Fake Image Detectio…
-
https://arxiv.org/abs/2303.06571
-
https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5
We introduce InternVL 1.5, an open-source multimodal large language model (MLLM) to bridge the capability gap between open-source and proprietary…
-
### Have I written custom code (as opposed to using a stock example script provided in MediaPipe)
No
### OS Platform and Distribution
iOS 16.4 and iOS 16.6
### MediaPipe Tasks SDK version
0.10.15…
V-m1r updated
4 weeks ago
-
Lora+base is working good
![image](https://github.com/mbzuai-oryx/LLaVA-pp/assets/15274284/ccec0900-7db0-4729-9ab4-3c5f68e0f304)
![image](https://github.com/mbzuai-oryx/LLaVA-pp/assets/15274284/7d12…
-
We could add filters to the leaderboard, similar to what we have for the plots. Could be even more complex, and lead to a re-ordering of the leaderboard.. Basically, could use all parameters that we a…
-
![image](https://github.com/paperswithlove/papers-we-read/assets/100809463/602058a1-017f-4f10-91fc-fab580e54c5b)
- 전체+분할 Low Res. Encoder화 High Res. Dual Encoder까지!!!
![image](https://github.com…
-
Hi,
This is a really valuable work, in particular I really like that you have results for various degrees of finetuning of language and vision encoders.
I am interested in evaluating some of y…