OpenBMB / BMTrain

Efficient Training (including pre-training and fine-tuning) for Big Models
Apache License 2.0
548 stars 74 forks source link

training script is stuck in bmt.init_distributed(seed=0) #72

Closed greenteaofwhu closed 1 year ago

greenteaofwhu commented 1 year ago

GPU:V100 torch version:1.10.0+cu113 Python 3.7.13 bmtrain:0.2.1 image image

a710128 commented 1 year ago

Have you tried putting init_distributed in front of get_args?

wyl7 commented 1 year ago

also have the same question

wyl7 commented 1 year ago

GPU:V100 torch version:1.10.0+cu113 Python 3.7.13 bmtrain:0.2.1 image image

Hello greenteaofwhu😁, do you solve this issue?

MayDomine commented 1 year ago

Please follow the instruction of bug issue template, and provide all details needed for reproduce your error.