[Feature] moe框架的模型为何需要单独支持？

Is your feature request related to a problem? Please describe. A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

Describe the solution you'd like A clear and concise description of what you want to happen.

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

Additional context Add any other context or screenshots about the feature request here.

https://github.com/alibaba/ChatLearn/tree/main chatlearn框架，为何moe需要单独支持，如果直接将llama等模型替换为moe模型比如deepseek-v2 进行训练，会有什么问题么

alibaba / ChatLearn

[Feature] moe框架的模型为何需要单独支持？ #158