declanoller / RWG_benchmarking

Analyzing Reinforcement Learning Benchmarks with Random Weight Guessing
MIT License
5 stars 0 forks source link

Simple variations: softmax outputs, no nonlinearities, etc. #2

Closed declanoller closed 5 years ago

declanoller commented 5 years ago

There are several slight variations that it would make sense to test. For example, now, for discrete action spaces, I just use an argmax across the outputs. It's possible that a softmax would be more effective for some env's.

Similarly, we have a nonlinearity right now, but it's possible that's not necessary for some env's (see winning agents for LunarLander-v2 and CartPole-v0 here: https://www.declanoller.com/2019/01/25/beating-openai-games-with-neuroevolution-agents-pretty-neat/ ; completely linear).

More broadly: make it so many variations can be tested for each.

declanoller commented 5 years ago

Added with PR #6 , closing.