policy.eval() after load_state_dict()

Dear Barhate,

Hi!!! Thank you for your sharing of the code very much! I can reproduce your results and they look super cool!

During my playing around,

may I consult in file PPO.py. line 83, after
self.policy_old.load_state_dict(self.policy.state_dict()) Do we need a
self.policy_old.eval() Will this line of code introduce some difference...That maybe you observed in the past?

I am consulting as in this PyTorch web https://pytorch.org/tutorials/beginner/saving_loading_models.html

It indicated we shall call an eval() ...

"Remember that you must call model.eval() to set dropout and batch normalization layers to evaluation mode before running inference. Failing to do this will yield inconsistent inference results."

But I didn't have a chance to fully test the difference yet, therefore come to consult maybe you already encountered something similar?

Thank you for reading! Best Regards, Xin

nikhilbarhate99 / PPO-PyTorch

policy.eval() after load_state_dict() #46