Closed wenyeming333 closed 7 years ago
Hi, I also could not get the algorithm to work on MountainCar -- I would be interested if anyone could get it to work
With OpenAI's distributed ES, it appears that people have been able to get MountainCar working, but not ContinuousMountainCar: https://github.com/openai/evolution-strategies-starter/issues/9
For anyone interested: the main difference between OpenAI's code and mine seems to be the batch size. OpenAI collects a large number of samples (several thousand) before taking a gradient step, and this seems to be necessary for even mildly difficult problems.
Does anyone train the model on MountainCar-v0? I can only obtain the minimum reward -200. I tried both smaller sigma and larger sigma but none of both worked.