dvlab-research MGM issues

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Apache License 2.0

3.22k stars 280 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

comfyUI 开始任务出现如下错误

#137 jimmyyu1989 opened 1 week ago
0
article error

#136 TuuSiwei opened 2 months ago
0
Unable to Merge LoRA Weights with Base Model: ValueError: Can't find 'adapter_config.json' at ...

#135 PARSA-MHMDI opened 2 months ago
0
CHAIR Evaluation

#134 itsqyh opened 4 months ago
0
TypeError: MGMllamaForCausalLM.forward() got an unexpected keyword argument 'cache_position'

#133 czm708033 opened 4 months ago
2
ImportError: cannot import name 'packaging' from 'pkg_resources'

#132 LiuRicky closed 4 months ago
1
when use stable-diffusion,AttributeError: 'NoneType' object has no attribute 'tokenize'

#131 ALR-alr opened 4 months ago
1
llama3 result is repeated many times

#130 pennypengpm opened 5 months ago
1
Does MGM support in-context(few-shot) inference?

#129 waltonfuture opened 5 months ago
0
Will there be support for Qwen2?

#128 huxian0402 opened 5 months ago
0
How to access hidden states?

#127 Divyanshsingh1910 opened 5 months ago
1
I get this error: WARNING: tokenization mismatch: 156 vs. 161. (ignored) when I finetune llama3

#126 shidingz opened 5 months ago
1
使用多gpu启动worker，对话时报错

#125 kimi360 closed 5 months ago
1
Do you meet the error "MGMConfig"?

#124 strawberryrs620 opened 6 months ago
0
loss 0 and grad nan

#123 TuuSiwei closed 6 months ago
3
error in loading

#122 TuuSiwei closed 6 months ago
2
Requirement for pretraining weights of LLaMa-3-8B-Instruct

#121 shiwk23 opened 6 months ago
0
Can provide laion-gpt4v dataset images zip?

#120 TuuSiwei opened 6 months ago
0
mgm-34b-hd, should have a 'model_type' key in its config.json

#119 chrisx599 opened 6 months ago
2
Inference problem about the demo.

#118 ApolloRay opened 6 months ago
1
The data for alignment and finetuning contains duplicates. Can you please explain why this is happening?

#117 KANGRuipeng opened 6 months ago
1
dataset miss problem

#116 TuuSiwei closed 6 months ago
1
| EORROR | stderr | RecursionError: Maximun recursion depth exceeded in comparison

#115 linyf38 opened 6 months ago
1
May I ask if the current inference code does not support multi images input

#114 Angelalilyer opened 6 months ago
1
多轮对话修改图像输入后报错

#113 pennypengpm opened 6 months ago
0
Generation-related Instructions dataset link

#112 berry-ding opened 6 months ago
1
关于多机多卡效果不如单机多卡好的问题

#111 DePengW opened 6 months ago
1
Loss does not decrease

#110 yfthu closed 6 months ago
0
可以放一下生成generation_pure_text数据的代码吗

#109 pennypengpm closed 6 months ago
4
LLama 70B support

#108 PrateekPal641 opened 6 months ago
0
Inference speed

#107 PrateekPal641 opened 6 months ago
0
lora initialisation missing from builder.py

#106 adrielkuek opened 6 months ago
2
Error while loading model with transformers library

#105 PrateekPal641 closed 6 months ago
3
Congratulations for the best LLaVA derived models !

#104 deepbeepmeep opened 7 months ago
1
Some weights of the model checkpoint were not used when initializing MGMLlamaForCausalLM

#103 charlesCXK opened 7 months ago
2
how to use stage2 ckpt fine-tuning stage3？

#102 linqinguang opened 7 months ago
1
计划加入DPO训练来缓解模型幻觉问题吗

#101 jiezhangGt opened 7 months ago
0
Take input image as condition.

#100 Adenialzz closed 7 months ago
2
How to fix [NETWORK ERROR DUE TO HIGH TRAFFIC. ] on MacOS ?

#99 seasoncool opened 7 months ago
2
stage2 loss is 0

#98 jiezhangGt closed 7 months ago
1
Excessive Length of Responses from Mini Gemini

#97 Dopplenum closed 6 months ago
1
AttributeError: 'OpenCLIPVisionTower' object has no attribute 'device'

#96 l1019008146 opened 7 months ago
2
Use of ocr in Evaluation

#95 bruceisme opened 7 months ago
1
'LlamaForCausalLM' object has no attribute 'get_vision_tower'

#94 HongLouyemeng closed 7 months ago
1
使用cli调用自定义微调模型，出现'OpenCLIPVisionTower' object has no attribute 'device'

#93 HongLouyemeng closed 7 months ago
16
pretrain error: lack of preprocessor_config.json

#92 jiezhangGt closed 7 months ago
1
Which deepspeed version is it

#91 Kareneveve opened 7 months ago
2
当我使用推理命令的时候出现网络错误，无法构建推理的接口

#90 HongLouyemeng closed 7 months ago
2
请问为什么在训练llama的脚本中，预训练和微调所使用的conv不一样

#89 shidingz opened 7 months ago
1
model asks self questions and answers

#88 Bowei-Li opened 7 months ago
2