LLaVA-VL / LLaVA-NeXT

1.01k stars 55 forks source link

Strange Model Loading Issue: Inconsistency with Vision Tower Parameters #82

Open shidingz opened 3 days ago

shidingz commented 3 days ago

I found something strange when loading the model. It seems that you have released the vision_tower during training, but when loading the vision_tower, you did not load the gradient-updating parameters, instead, you loaded the original vision tower (you loaded the model through the name mm_vision_tower). Can you please explain why this is happening?