Closed ZhuYanzhen1 closed 5 months ago
warmup_steps 替换成 warmup_ratio [TrainingArguments] warmup_ratio (float, optional, defaults to 0.0) — Ratio of total training steps used for a linear warmup from 0 to learning_rate. warmup_steps (int, optional, defaults to 0) — Number of steps used for a linear warmup from 0 to learning_rate. Overrides any effect of warmup_ratio.
已解决,谢谢
Reminder
Reproduction
我使用命令
./train.sh
发起对LLAMA3-70B的全参数训练,我使用的显卡是3张 A100-SXM4-40GB,以下是train.sh的内容。以下是llama3_sft_multi.yaml的内容,其中
model_name_or_path
一项我设置为了本地的模型。该模型是从Meta官网下载的LLAMA3-Instruct模型的pth文件经由transformers脚本转换后得到的:以下是
deepspeed_z3_config.json
的内容:运行
./train.sh
后报以下错误:Expected behavior
使用三张显卡进行LLAMA3-70B的全参量训练
System Info
transformers
version: 4.42.0.dev0Others
No response