jackroos / VL-BERT

Code for ICLR 2020 paper "VL-BERT: Pre-training of Generic Visual-Linguistic Representations".
MIT License
735 stars 110 forks source link

The problem of multitasking parallel processing #68

Open Pter61 opened 3 years ago

Pter61 commented 3 years ago

While I was training in 2 GPU(V100 16G), this error occurred in the middle of the second epoch ... File: ".../python3.6/site-packages/torch/autograd/inti/.py", line93, in backward allow_unreachable=True) # allow_unreachable flag RuntimeRrror: element 0 of tensors does not require grad and does no have grad_fn

However, I can continue to train in a single GPU. Could you please help me deal with this problem?

alice-cool commented 3 years ago

Could you help me the #include didn't have the file?

jackroos commented 3 years ago

@Pter61 What config do you run? And what is the full command you ran? And please provide your pytorch version and cuda version also.

alice-cool commented 3 years ago

I just follow the readme'' Environment to run conda create -n...  conda install...  git clone(apex)but it throws exception pop install Cython -r requiremen t ./scripts/init.sh (It has errors, which didn't know #include<torch.extension.h>)

Thanks for you attention

---Original--- From: "Weijie Su"<notifications@github.com> Date: Wed, Dec 30, 2020 11:53 AM To: "jackroos/VL-BERT"<VL-BERT@noreply.github.com>; Cc: "Comment"<comment@noreply.github.com>;"Donglearner"<2524981200@qq.com>; Subject: Re: [jackroos/VL-BERT] The problem of multitasking parallel processing (#68)

@Pter61 What config do you run? And what is the full command you ran?

— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or unsubscribe.

alice-cool commented 3 years ago

I have solved the problem. Thanks