As picture shows, result is long way from 456 that RL Baselines Zoo got to. I have used more hyperparameters, but scores are always much lower.
What I'm aware of that can have impact on this issue is seed, as I didn't pick the same. Nevertheless I have tried many instances of A2C and the problem remains.
Checklist
[X] I have checked that there is no similar issue in the repo
❓ Question
Hello, I first optimize A2C on 1mln steps using RL Baselines3 Zoo:
Firstly i have changed
a2c.yml
in RL Baselines3 Zoo to work with RAM version of Seaquest:Then wrote command:
Top 3 results: Then using for example these hyperparameters: and using this code:
I get results:
As picture shows, result is long way from 456 that RL Baselines Zoo got to. I have used more hyperparameters, but scores are always much lower. What I'm aware of that can have impact on this issue is seed, as I didn't pick the same. Nevertheless I have tried many instances of A2C and the problem remains.
Checklist