yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)
MIT License
2.81k stars 351 forks source link

Segment Fault 是哪的问题? #165

Open wanghaosjtu opened 8 months ago

wanghaosjtu commented 8 months ago

在一个没有GPU但是有500G内存的机器上run zero_nlp/chatglm_v2_6b_lora/train.sh, 这是哪出问题了?

[INFO|configuration_utils.py:768] 2024-01-02 16:39:58,209 >> Generate config GenerationConfig {
"_from_model_config": true,
"eos_token_id": 2,
"pad_token_id": 0,
"transformers_version": "4.33.2"
}

/usr2/haow/workspace/miniconda3/envs/ptune/lib/python3.8/site-packages/torch/utils/checkpoint.py:429: UserWarning: torch.utils.checkpoint: please pass in use_reentrant=True or use_reentrant=False explicitly. The default value of use_reentrant will be updated to be False in the future. To maintain current behavior, pass use_reentrant=True. It is recommended that you use use_reentrant=False. Refer to docs for more details on the differences between the two variants. warnings.warn( free(): invalid pointerfree(): invalid pointerfree(): invalid pointerfree(): invalid pointerfree(): invalid pointerfree(): invalid pointerfree(): invalid pointerfree(): invalid pointerfree(): invalid pointerfree(): invalid pointerfree(): invalid pointerfree(): invalid pointer free(): invalid pointer./train.sh: line 26: 19064 Segmentation fault python main.py