shibing624 / MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
Apache License 2.0
2.94k stars 451 forks source link

使用qwen进行pretrain的时候出现了问题:Cannot copy out of meta tensor; no data! #346

Open cove1011 opened 3 months ago

cove1011 commented 3 months ago

Traceback (most recent call last): File "F:\xiazai\MedicalGPT-main\pretraining.py", line 781, in main() File "F:\xiazai\MedicalGPT-main\pretraining.py", line 722, in main trainer = SavePeftModelTrainer( File "C:\Users\admin.conda\envs\newrlhf\lib\site-packages\transformers\trainer.py", line 489, in init self._move_model_to_device(model, args.device) File "C:\Users\admin.conda\envs\newrlhf\lib\site-packages\transformers\trainer.py", line 730, in _move_model_to_device model = model.to(device) File "C:\Users\admin.conda\envs\newrlhf\lib\site-packages\torch\nn\modules\module.py", line 1152, in to return self._apply(convert) File "C:\Users\admin.conda\envs\newrlhf\lib\site-packages\torch\nn\modules\module.py", line 802, in _apply module._apply(fn) File "C:\Users\admin.conda\envs\newrlhf\lib\site-packages\torch\nn\modules\module.py", line 802, in _apply module._apply(fn) File "C:\Users\admin.conda\envs\newrlhf\lib\site-packages\torch\nn\modules\module.py", line 802, in _apply module._apply(fn) [Previous line repeated 5 more times] File "C:\Users\admin.conda\envs\newrlhf\lib\site-packages\torch\nn\modules\module.py", line 825, in _apply param_applied = fn(param) File "C:\Users\admin.conda\envs\newrlhf\lib\site-packages\torch\nn\modules\module.py", line 1150, in convert return t.to(device, dtype if t.is_floating_point() or t.is_complex() else None, non_blocking) NotImplementedError: Cannot copy out of meta tensor; no data!

shibing624 commented 3 months ago

显存不够,量化模型或者换机器。