luogen1996 / LLaVA-HR

LLaVA-HR: High-Resolution Large Language-Vision Assistant
Apache License 2.0
202 stars 9 forks source link

unfreeze vision encoder on SFT #12

Closed syp2ysy closed 4 months ago

syp2ysy commented 4 months ago

Thank you very much for your work, it has been very helpful to me. I just wanted to confirm a small detail with you. During the training (SFT stage), you are training the vision encoder. However, during the evaluation stage, are you still loading the weights of the untrained visual encoder?