请问一下指令微调（instruct finetune）可以使用32GB显存的v100吗？

s-JoL / Open-Llama

The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.

MIT License

30 stars 4 forks source link

请问一下指令微调（instruct finetune）可以使用32GB显存的v100吗？目前有4卡的v100，每张卡显存是32G，有没有办法可以使用4卡的v100进行指令微调操作？试过了stage2还是报显存不够。

试过ds_stage3的配置文件，发现报了如下错误，有人知道是什么原因吗？非常感谢启动命令如下：accelerate launch --config_file configs/accelerate_configs/ds_stage3.yaml train_lm.py --train_config configs/instruct_config.yaml --model_config configs/model_configs/7B.json

报错如下： File "/home/fenbi/miniconda3/envs/mc-model/lib/python3.9/site-packages/transformers/models/open_llama/modeling_open_llama.py", line 385, in _init_weights module.weight.data[module.paddingidx].zero() IndexError: index 32000 is out of bounds for dimension 0 with size 0

s-JoL / Open-Llama

请问一下指令微调（instruct finetune）可以使用32GB显存的v100吗？ #61