Closed ShaniGam closed 7 years ago
Hi. Yes, I works for me for pytorch 0.2.0. What do you mean by mp doesn't work?
all the training processes exit after:
(policy_loss + args.value_loss_coef * value_loss).backward()
in some cases I get a core file that says:
Program terminated with signal 11, Segmentation fault.
but it doesn't say where it happens exactly
I did manage to run other python files using pytorch 0.2
Interesting. How many RAM do you have?
252 GB it worked perfectly with pytorch 1.2 (but I have to upgrade)
Does it happen immediately or after several hours of training?
What arguments do you use?
Immediately, default arguments.
Do you use python2 or python3?
At the moment, I cannot reproduce the issue. I need more information.
python 2.7 I deleted all the torch files and installing it from scratch now, It might be a conflict between the versions (even though it does work on other files)
This code works only for python 3. I'm not planning to add support for python 2 any time soon (it's already scheduled to be deprecated anyway).
Oh, so that's why! btw, thank you for the code, it's been very useful in my research :)
Have you tried running it with pytorch 0.2.0? seem like the multiprocessing doesn't work.