mllm Search Results - Githubissues

635 results
for mllm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

amusi/CVPR2024-Papers-with-Code #231

Please add new papers about 多模态大语言模型(MLLM), fairness learni…

**FairCLIP: Harnessing Fairness in Vision-Language Learning** Paper Link: https://arxiv.org/abs/2403.19949 Code Link: https://github.com/Harvard-Ophthalmology-AI-Lab/FairCLIP another paper on A…

tianyu0207 updated 7 months ago
1
whwu95/FreeVA #6

How does it compare with just directly send T,N,D into LLM?

LLava supports multiple images by default, what if send T,N,D into LLM without any aggregation?

lucasjinreal updated 5 months ago
6
DingchenYang99/Pensieve #2

Consufion of the "compare visual concepts"

The idea of this work is very interesting! While I have two confusions about the method: (1) What's the ground truth caption of the image in Fig. 2? Is the word "feather" correct? (I am not sure…

QiushiYang updated 6 months ago
2
THUDM/CogVLM #401

Query pdf files instead of images

### Feature request / 功能建议 Dear CogVLM's authors, Thank you for your outstanding work on MLLM. In the demo, we can only query pictures. Is it possible to make the model process pdf files? ### Mot…

moncefarajdal updated 7 months ago
1
open-compass/VLMEvalKit #101

Detailed results of ScienceQA-IMG

Thanks for the great effort of this repo! I see you provide the zero-shot results of several MLLMs on ScienceQA-IMG dataset. Could you please add the detailed results (i.e., NAT, SOC, LAN) of the TEST…

thecharm updated 7 months ago
3
OpenGVLab/Ask-Anything #144

34b large language model hermes2_yi34b

Recently, som MLLMs hava adapted hermes2_yi34b as base language model, such as [InternVL](hermes2_yi34b), [LLava](https://github.com/haotian-liu/LLaVA) . Have your team applied it to the project, lik…

hn18001 updated 8 months ago
5
bfshi/scaling_on_scales #4

Num of tokens in LLaVA

Hi, thank you for this great work! In Table 1 of your paper, accuracy improvement is reported by adding S2 Scaling to LLaVA. As shown in Figure 1, the channel dimension of S2 Scaling is double …

RussRobin updated 7 months ago
3
NExT-ChatV/NExT-Chat #14

Question for masked_loss

Hi, thank you for your implementation. While I'm viewing your code lines, a question arises about the 'masked loss.' Why do you mask out the last part of each loss using this function? https:…

planemanner updated 8 months ago
2
Arthurmcarthur/Cangjie3-Plus #22

一大批問題編碼+兼容方針探討

```diff --- a/cj3.txt +++ b/cj3.txt @@ -30598,7 +30598,7 @@ nhytg 䂌 nic 䥒 nif 㢱 nij 㚈 -nimnb 㣇 +smhhb 㣇 njbc 㣀 nkbr 㢠 nkf 㷺 @@ -33021,6 +33021,7 @@ vmfj 㛁 vmfm…

danny0838 updated 4 years ago
4
QwenLM/Qwen2-VL #174

qwen2-vl-7B-instruct输入类型为"image"时,报"Unknown part type: image…

'{"object":"error","message":"Unknown part type: image","type":"BadRequestError","param":null,"code":400}'

yangxin60-tal updated 1 month ago
15

上一页 1...48 49 50 51 52 53 54...64 下一页

635 results for mllm

635 results
for mllm