Open TrentBrick opened 4 years ago
seed = 0 failed to converge for me
Hi Treton,
Good observation, and happy that someone even tried to reproduce it. No, unfortunately I didn't try different seeds since my main goal was to understand and successfully implement the paper testing it on a different environment. I honestly didn't play much with hyperparameters and as you can see I didn't run the algorithm with other seeds for robustness. So there is plenty of room for improvement.
I'd be super happy to see more robust implementations.
Thanks a lot, Fran
On Tue, Jul 14, 2020 at 1:40 AM Trenton Bricken notifications@github.com wrote:
Thanks for sharing your code and reproduction!
I am finding that everything runs great when I use the seed=0 you provided but not with other random seeds. Were you also finding your results to not be robust?
Thanks. Trenton
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/jscriptcoder/Upside-Down-Reinforcement-Learning/issues/1, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAEV2XHAHIMPMQ34ZZZ5ZDTR3OLNZANCNFSM4OY67Y3A .
Thanks for sharing your code and reproduction!
I am finding that everything runs great when I use the seed=0 you provided but not with other random seeds. Were you also finding your results to not be robust?
Thanks. Trenton