THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Apache License 2.0
40.47k stars 5.19k forks source link

[Help] 关于微调脚本中的dev数据集 #1395

Open juhengzhe opened 1 year ago

juhengzhe commented 1 year ago

Is there an existing issue for this?

Current Behavior

https://github.com/THUDM/ChatGLM2-6B/blob/main/ptuning/train.sh脚本中有两个数据集: --train_file AdvertiseGen/train.json \ --validation_file AdvertiseGen/dev.json \ 1、请问在do_train时,是否会用到验证集:--validation_file 2、如何在训练日志里同时输出模型在训练集和验证集上的loss

Expected Behavior

No response

Steps To Reproduce

bash train.sh

Environment

- OS:centos7
- Python:3.9
- Transformers:4.27
- PyTorch:2.0.1
- CUDA Support (`python -c "import torch; print(torch.cuda.is_available())"`) :11.7

Anything else?

No response