alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Apache License 2.0
674 stars 94 forks source link

qwen moe模型训练脚本的参数是不是不对?能提供正确的训练脚本吗 #234

Closed jianhai0527 closed 4 months ago

jianhai0527 commented 4 months ago
image

RT

jerryli1981 commented 4 months ago

您好,ReadMe已更新,烦请CR:https://github.com/alibaba/Pai-Megatron-Patch/pull/239

image