hill-a / stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
http://stable-baselines.readthedocs.io/
MIT License
4.16k stars 725 forks source link

Resnet Observation Policy Cannot Reload?#question#bug #1056

Closed yiwc closed 3 years ago

yiwc commented 3 years ago

Hi Stable Baselines Team,

Great work in developing this package, thanks a lot! However, I have met a problem.

I have a customized policy with the resnet v2 as the observation module. It shows a good convergence in the training. However, when I saved this module after training (for example, return=40, task success), and reload it in the evaluation, it shows bad performance (return = 10, no okey).

I suspect this module is not well saved? or if you have a recommend way to integrate with resnet observation network?

Later, I will give you a minimum code to reproduce this issue.

If you have any idea what's could lead to this problem, pls share your opinion. Thanks!

Miffyli commented 3 years ago

Things that pop into mind without further details: