DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
https://stable-baselines3.readthedocs.io
MIT License
8.85k stars 1.68k forks source link

A succesfully ppo trained agent demands some steps of re-training to make good predictions #2009

Closed tanielsfranklin closed 2 weeks ago

tanielsfranklin commented 2 weeks ago

🐛 Bug

A succesfully trained agent demands some steps of re-training to make good predictions

I'm using a custom env. I've trained it until reach 100% of success. In the end of training it is saved it is evaluated by 10 episodes with good results

When I try to load it performs poorly.

I wrote a single script to use the same parameters used for training. After load the model the evaluation is very bad I re-train it for 1000 steps (1 or 100 doesn't work) After re-train it performs good I tried to evaluate other eval_env but the result was unsuccessful

image

The terminal output after run the script

image

Relevant log output / Error message

No response

System Info

Checklist

tanielsfranklin commented 2 weeks ago

I found a mistake in model loader script. My bad.