thunlp / LLaVA-UHD

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
303 stars 15 forks source link

update vit #3

Closed yiyexy closed 5 months ago

yiyexy commented 6 months ago

Thanks for your great work! I can't find the code for updating vit during fine-tuning stage. Could you please provide a reference?

guozonghao96 commented 6 months ago

You can refer to our paper. We do not update ViT in the sft stage.