SkyworkAI / Skywork

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
Other
1.21k stars 111 forks source link

英文评测中的eval loss #80

Open XXares opened 4 months ago

XXares commented 4 months ago

您好,想问问technical report里面展示的eval loss是用只用gsm8k_test这一个任务做验证loss吗? 然后用这一个的task的eval loss和其他任务的平均task metric做分析?