Resnet Observation Policy Cannot Reload?#question#bug

hill-a / stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

MIT License

4.16k stars 725 forks source link

Hi Stable Baselines Team,

Great work in developing this package, thanks a lot! However, I have met a problem.

I have a customized policy with the resnet v2 as the observation module. It shows a good convergence in the training. However, when I saved this module after training (for example, return=40, task success), and reload it in the evaluation, it shows bad performance (return = 10, no okey).

I suspect this module is not well saved? or if you have a recommend way to integrate with resnet observation network?

Later, I will give you a minimum code to reproduce this issue.

If you have any idea what's could lead to this problem, pls share your opinion. Thanks!

hill-a / stable-baselines

Resnet Observation Policy Cannot Reload?#question#bug #1056