Closed WangZY1111 closed 4 months ago
Please provide a clear and concise description of what the bug is. If applicable, add screenshots to help explain your problem, especially for visualization related problems.
不需要合并,全参微调的模型就是大模型,跟正常训练模型一样。
Describe the bug
Please provide a clear and concise description of what the bug is. If applicable, add screenshots to help explain your problem, especially for visualization related problems.