Closed tristandeleu closed 6 years ago
HalfCheetah
Ant
gym
NormalizedActionWrapper
NormalizedObservationWrapper
NormalizedRewardWrapper
-v1
xavier_uniform
NormalMLPPolicy
HalfCheetah
andAnt
) now inherit fromgym
NormalizedActionWrapper
,NormalizedObservationWrapper
andNormalizedRewardWrapper
-v1
. The new versions of these environments have normalized actionsxavier_uniform
NormalMLPPolicy
is now a parameter, independent of the input