請問只微調視覺模型，而不全量或LoRA微調LLM所訓練後的權重要如何使用呢?

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Apache License 2.0

12.73k stars 892 forks source link

Open sam880717 opened 1 month ago

sam880717 commented 1 month ago

請問在訓練MiniCPM-Llama3-V 2.5時，透過以下設定只微調視覺模型，而不全量或LoRA微調LLM所訓練後的權重要如何使用呢? --tune_vision true --tune_llm false --use_lora false

嘗試直接載入模型會出現下方錯誤 AttributeError: 'MiniCPMVTokenizerFast' object has no attribute 'tokenizer'. Did you mean: '_tokenizer'?

LDLINGLINGLING commented 1 month ago

您好，请您对比保存后的路径和没有微调之前的路径，是否存在缺失文件，如果缺失文件复制到保存路径是否能够解决该问题