A succesfully trained agent demands some steps of re-training to make good predictions
I'm using a custom env.
I've trained it until reach 100% of success. In the end of training it is saved it is evaluated by 10 episodes with good results
When I try to load it performs poorly.
I wrote a single script to use the same parameters used for training.
After load the model the evaluation is very bad
I re-train it for 1000 steps (1 or 100 doesn't work)
After re-train it performs good
I tried to evaluate other eval_env but the result was unsuccessful
🐛 Bug
A succesfully trained agent demands some steps of re-training to make good predictions
I'm using a custom env. I've trained it until reach 100% of success. In the end of training it is saved it is evaluated by 10 episodes with good results
When I try to load it performs poorly.
I wrote a single script to use the same parameters used for training. After load the model the evaluation is very bad I re-train it for 1000 steps (1 or 100 doesn't work) After re-train it performs good I tried to evaluate other eval_env but the result was unsuccessful
The terminal output after run the script
Relevant log output / Error message
No response
System Info
Checklist