Open yinweisu opened 2 years ago
Hi @yinweisu Thanks for reporting. I believe this is a valid issue. Was able to reproduce it on my set up as well. After a bit digging, it seems this is a known issue with ptl 1.5: https://github.com/PyTorchLightning/pytorch-lightning/discussions/11435 https://github.com/PyTorchLightning/pytorch-lightning/issues/12327
The solution is basically to upgrade to ptl 1.6. @amogkam Another datapoint that we should do the upgrade sooner than later.
Thanks! And yes, upgrade to ptl 1.6 soon would be awesome!
+1 for upgrading to PyTorch Lightning 1.6! Is there an estimate for when that work might occur?
When using PBT/PB2, I received the following error:
This issue happens after the trial is paused and resumed. I was able to reproduce this issue with some modifications on the example provided by ray lightning:
The args I passed in:
python3 test_ray_lightning.py --use-gpu --num-workers 2 --num-samples 4
Versions: