mllm Search Results - Githubissues

480 results
for mllm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

X-PLUG/mPLUG-DocOwl #34

Spelling errors in DocStruct4M, 'multi_grained_text_localiza…

All the question prompts are extracted from DocStruct4M, 'multi_grained_text_localization.jsonl' as below, ``` [ "Give the bounding box of the text", "Predict the bounding box of the text", …

chrysanthemum-515 updated 5 months ago
2
khanrc/honeybee #18

Question about "Projector"

Hello, As I was meticulously reading a paper, I found myself confused about the section on 'projectors.' Background: From what I understand so far, in the case of CLIP ViT Large, despite the com…

Joshmeaning updated 4 months ago
1
whwu95/FreeVA #6

How does it compare with just directly send T,N,D into LLM?

LLava supports multiple images by default, what if send T,N,D into LLM without any aggregation?

lucasjinreal updated 2 months ago
6
ml-explore/mlx-examples #207

[Feature Request] Example of MLLM using MLX

With [ml-ferret](https://github.com/apple/ml-ferret) out, it would be great to include an MLLM example in this repo, namely with ml-ferret or just LlaVA itself. Being LLAMA based, I think this would …

fozziethebeat updated 6 months ago
12
amusi/CVPR2024-Papers-with-Code #231

Please add new papers about 多模态大语言模型(MLLM), fairness learni…

**FairCLIP: Harnessing Fairness in Vision-Language Learning** Paper Link: https://arxiv.org/abs/2403.19949 Code Link: https://github.com/Harvard-Ophthalmology-AI-Lab/FairCLIP another paper on A…

tianyu0207 updated 5 months ago
1
streamlit/streamlit #7588

Streamlit running the first few lines, looping without execu…

### Checklist - [X] I have searched the [existing issues](https://github.com/streamlit/streamlit/issues) for similar issues. - [X] I added a very descriptive title to this issue. - [X] I have provide…

Gilnore updated 3 weeks ago
2
THUDM/CogVLM #398

How long does it take to finetune or train the models?

Dear CogVLM's authors, Thank you for your outstanding work on MLLM. Can you share a bit about estimating the time required to fine-tune or train the model? ``` Hardware requirement Model In…

kynthesis updated 5 months ago
2
ggerganov/llama.cpp #3778

Support for LLAMA V.1.5 Multimodal LLMs

Curious if MLLMs can work on it. I am already supposing LLAMA V1.5 can't . I can suggest checking out more efficient MLLM models like X-LLM

Dyl777 updated 5 months ago
4
DingchenYang99/Pensieve #2

Consufion of the "compare visual concepts"

The idea of this work is very interesting! While I have two confusions about the method: (1) What's the ground truth caption of the image in Fig. 2? Is the word "feather" correct? (I am not sure…

QiushiYang updated 4 months ago
2
OpenGVLab/InternVL #168

想问下，模型pretrain的时候用了那个类似UHD的切图吗？

如题。。如果pretrain就把图片切那么多份，训练成本是不是有些cover不住

GYxiaOH updated 3 months ago
8

上一页 1...32 33 34 35 36 37 38...48 下一页

480 results for mllm

480 results
for mllm