mllm Search Results - Githubissues

635 results
for mllm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

changhaonan/A3VLM #2

Code Big Error！！！

The code you gave in render_robot_pyrender.py file may have some minor problems, but it can be executed normally. partnet_label.py also has many errors. First of all, the package from handal_label imp…

3202336152 updated 4 months ago
6
ml-explore/mlx-examples #207

[Feature Request] Example of MLLM using MLX

With [ml-ferret](https://github.com/apple/ml-ferret) out, it would be great to include an MLLM example in this repo, namely with ml-ferret or just LlaVA itself. Being LLAMA based, I think this would …

fozziethebeat updated 8 months ago
12
ggerganov/llama.cpp #3778

Support for LLAMA V.1.5 Multimodal LLMs

Curious if MLLMs can work on it. I am already supposing LLAMA V1.5 can't . I can suggest checking out more efficient MLLM models like X-LLM

Dyl777 updated 7 months ago
4
X-PLUG/mPLUG-DocOwl #34

Spelling errors in DocStruct4M, 'multi_grained_text_localiza…

All the question prompts are extracted from DocStruct4M, 'multi_grained_text_localization.jsonl' as below, ``` [ "Give the bounding box of the text", "Predict the bounding box of the text", …

weichenma15 updated 7 months ago
2
THUDM/CogVLM #398

How long does it take to finetune or train the models?

Dear CogVLM's authors, Thank you for your outstanding work on MLLM. Can you share a bit about estimating the time required to fine-tune or train the model? ``` Hardware requirement Model In…

kynthesis updated 7 months ago
2
khanrc/honeybee #18

Question about "Projector"

Hello, As I was meticulously reading a paper, I found myself confused about the section on 'projectors.' Background: From what I understand so far, in the case of CLIP ViT Large, despite the com…

Joshmeaning updated 6 months ago
1
OpenGVLab/InternVL #168

想问下，模型pretrain的时候用了那个类似UHD的切图吗？

如题。。如果pretrain就把图片切那么多份，训练成本是不是有些cover不住

GYxiaOH updated 5 months ago
8
mini-sora/minisora #346

Copy ELLA paper to `Diffusion UNet` and `training efficiency…

![image](https://github.com/mini-sora/minisora/assets/8240984/0d4df698-a324-466b-911d-f561160c5a8c) ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment Efficient Large Languag…

chg0901 updated 7 months ago
1
baaivision/EVA #142

Can EVA-CLIP-8B and EVA-CLIP-18B support quantization?

Can EVA-CLIP-8B and EVA-CLIP-18B support quantization? My device doesn't have such high specifications, and I'm worried I won't be able to run these models. My device currently has only a little over …

Shimooth updated 7 months ago
6
tsb0601/MMVP #8

The accuracy of LLaVA-1.5-7b with CLIP encoder is 60.0 on MM…

I evaluated LLaVA-1.5-7b on the MMVP dataset and found that its accuracy is 60.0%, which is significantly higher than the 24.7% reported in Table 3. Upon comparing the evaluation code, I discovered t…

Richar-Du updated 8 months ago
3

上一页 1...47 48 49 50 51 52 53...64 下一页

635 results for mllm

635 results
for mllm