Open alanyuwenche opened 10 months ago
I applied the code to train MountainCar-v0 but failed after 10 million timesteps. The command is as follows.
!python ppo.py --gym-id MountainCar-v0 --total-timesteps 10000000
Are these arguments correct? Or is it very hard to train MountainCar-v0 by PPO?
I applied the code to train MountainCar-v0 but failed after 10 million timesteps. The command is as follows.
!python ppo.py --gym-id MountainCar-v0 --total-timesteps 10000000
Are these arguments correct? Or is it very hard to train MountainCar-v0 by PPO?