Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数,训练数据,评估数据,评估方法。
Other
1.21k
stars
111
forks
source link
ValueError: Trainer: evaluation requires an eval_dataset. #25
在预训练最后一个step时需要评估验证指标,因为没有指定eval data而报错了,请问怎么关掉这个?ValueError: Trainer: evaluation requires an eval_dataset. metrics = self.evaluate(ignore_keys=ignore_keys_for_eval) File "/home/suser/.conda/envs/llm/lib/python3.10/site-packages/transformers/trainer.py", line 3062, in evaluate eval_dataloader = self.get_eval_dataloader(eval_dataset) File "/home/suser/.conda/envs/llm/lib/python3.10/site-packages/transformers/trainer.py", line 888, in get_eval_dataloader raise ValueError("Trainer: evaluation requires an eval_dataset.") ValueError: Trainer: evaluation requires an eval_dataset.