RicardoDominguez / PyCREPS

Contextual Relative Entropy Policy Search for Reinforcement Learning in Python
15 stars 1 forks source link

CartPole solved condition #12

Closed RicardoDominguez closed 6 years ago

RicardoDominguez commented 6 years ago

OpenAI gym considers the cartPole problem solved when the reward over 100 episodes is >= 195 link.

Currently considered solved when the reward over 100 episodes is >= 200.

RicardoDominguez commented 6 years ago

Update documentation.