Open LengSicong opened 5 months ago
Hi! We just use the official LLaVA-1.5's stage1 checkpoints. Pretraining stage only finetunes (mlp-based) projector, and the llm weight is from Vicuna-1.5. You can download the projector's weight here: https://github.com/haotian-liu/LLaVA/blob/main/docs/MODEL_ZOO.md#projector-weights
Hi authors, congrats on this great work!
May I know if you can release the checkpoints of Visual-CoT after the pre-training stage? So that we can follow by fine-tuning on those ckpts.