baichuan-inc / Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.
https://huggingface.co/baichuan-inc/baichuan-7B
Apache License 2.0
5.67k stars 506 forks source link

[Question] 请问继续预训练的loss降到什么水平是合格的 #115

Open parkLGW opened 1 year ago

parkLGW commented 1 year ago

Required prerequisites

Questions

用自有数据继续预训练,loss一直在2.1左右,请问是正常的吗 image

Checklist