Open liboaccn opened 1 month ago
补充,修改过 clip->siglip
image_processor = dict(
type=SiglipImageProcessor.from_pretrained,
pretrained_model_name_or_path=visual_encoder_name_or_path,
trust_remote_code=True)
model = dict(
type=LLaVAModel,
freeze_llm=True,
freeze_visual_encoder=True,
llm=dict(
type=AutoModelForCausalLM.from_pretrained,
pretrained_model_name_or_path=llm_name_or_path,
trust_remote_code=True),
visual_encoder=dict(
type=SiglipVisionModel.from_pretrained,
pretrained_model_name_or_path=visual_encoder_name_or_path))
微调llava-llama3-8b的时候 从几个step后就开始loss=nan了 这个可能是什么原因呢?我看github issue也有人遇到类似问题 官方回复是改lr 我现在设置的