Closed sweetboxwwy closed 10 months ago
Please provide a clear and concise description of what the bug is. If applicable, add screenshots to help explain your problem, especially for visualization related problems.
测试了不同的数据集在baichuan2训练后的融合模型推理时间,发现 --model_max_length 为1024时推理时间正常,但修改为512训练后,推理时间变长很多
Describe the bug
Please provide a clear and concise description of what the bug is. If applicable, add screenshots to help explain your problem, especially for visualization related problems.