sebascuri / qreps

5 stars 4 forks source link

Fix cartpole always using 0 as seed. #3

Closed sebimarkgraf closed 3 years ago

sebimarkgraf commented 3 years ago

The seed in the cartpole env is currently fixed to 0, which does not work with the launch of all experiments. When using other seeds the current performance is not very good.

This can be fixed by optimizing the hyperparameters. My quick and dirty optimization arrived at eta=1.0, alpha=0.1 and lr=0.02 which seems to be more in line with the rest of the environments.

If you want to discuss any of this in-depth, I would be very happy to hear from you!

sebascuri commented 3 years ago

Thanks!!!