Closed chamorajg closed 3 years ago
Hi, I'm a bot from the Ray team :)
To help human contributors to focus on more relevant issues, I will automatically add the stale label to issues that have had no activity for more than 4 months.
If there is no further activity in the 14 days, the issue will be closed!
You can always ask for help on our discussion forum or Ray's public slack channel.
While restoring the checkpoint trainer has a same episode reward mean when it was starting afresh.
Ray version (0.8.5) and other system information (Python 3.6, Pytorch, Ubuntu-16.04): The episode reward after restoring and after 1st iteration both are same. While saving checkpoint the trainer gave a positive episode reward mean.
The weights aren't the same after restoring and before restoring but the episode reward is not similar to a pretrained model. @ @