Closed modanesh closed 3 years ago
The performance of the already provided model was around 194.60 +/- 79.90 on 10 test episodes. I trained a new agent which has a much better performance on 10 test episodes: 2240.81 +/- 6.36.
I used the rl baseline3 zoo code for training.
Hello,
Thanks =) The performance was indeed far away from what it can be.
The performance of the already provided model was around 194.60 +/- 79.90 on 10 test episodes. I trained a new agent which has a much better performance on 10 test episodes: 2240.81 +/- 6.36.
I used the rl baseline3 zoo code for training.