Closed mgkumar138 closed 5 years ago
Are you clipping your actions to within the allowed range for the action space of the environment? https://github.com/openai/gym/blob/master/gym/envs/classic_control/pendulum.py#L20
Closing due to lack of activity. Possibly related issue: https://github.com/openai/baselines/issues/121
i am testing my AC code using Pendulum-v0 environment and got this error:
gym\envs\classic_control\pendulum.py:88: RuntimeWarning: invalid value encountered in remainder return (((x+np.pi) % (2*np.pi)) - np.pi)
Afterwhich my rewards during training were [nan]: Episode 23 of 100, score [nan]. Would anyone know what might be the cause for this?