请问是否支持在NPU上训练多模态大模型

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)

https://swift.readthedocs.io/zh-cn/latest/Instruction/index.html

Apache License 2.0

3.7k stars 316 forks source link

请问是否支持在NPU上训练多模态大模型 #1941

Open ChingKwanCheung opened 1 month ago

ChingKwanCheung commented 1 month ago

请问是否支持在NPU上训练多模态大模型？例如internvl2，qwen2-vl的lora，全参微调？https://github.com/modelscope/ms-swift/blob/main/docs/source/LLM/NPU%E6%8E%A8%E7%90%86%E4%B8%8E%E5%BE%AE%E8%B0%83%E6%9C%80%E4%BD%B3%E5%AE%9E%E8%B7%B5.md 这个LLM在NPU上的最佳实践是否可参考？

tastelikefeet commented 1 month ago

可以试一下，应该是一样的

ChingKwanCheung commented 1 month ago

可以试一下，应该是一样的

请问NPU上多模态大模型的预训练也支持么？