haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
https://llava.hliu.cc
Apache License 2.0
19.81k stars 2.18k forks source link

为什么要重新初始化了lm_head #837

Open Felix0805 opened 11 months ago

Felix0805 commented 11 months ago

Question

在llava_llama.py中可以看到lm_head被重新初始化覆盖了llama的lm_head,为何要重新初始化呢,不能复用llama的lm_head,但是embedding复用了llama的embedding,请问有什么实验经验么

Felix0805 commented 11 months ago

好像也是复用了llama的lm_head,为何要在LlavaLlamaForCausalLM类中再定义一下lm_head呢

shipengai commented 10 months ago

+1 same question @haotian-liu hao

421zuoduan commented 6 months ago

same question