joschu / modular_rl

Implementation of TRPO and related algorithms
MIT License
622 stars 156 forks source link

Will dropout break out the final loss of ppo algorithm? #26

Open ppaanngggg opened 7 years ago

ppaanngggg commented 7 years ago

If I add dropout layer to model, will it be a bad idea?

Any experiments there?

ppaanngggg commented 7 years ago

I use eval model when explore environment, and use train model for policy, old policy and value model when training