I am experiencing a problem with version v2020.06.3 training with DDPG+HER on the FetchReach-v1 environment.
Using the provided example, I get an index error:
IndexError: only integers, slices (:), ellipsis (...), numpy.newaxis (None) and integer or boolean arrays are valid indices
which comes from
File (...)/python3.7/site-packages/garage/tf/policies/continuous_mlp_policy.py", line 144, in get_actions observations = self.observation_space.flatten_n(observations)
from what I could see the problem comes from the fact that the OffPolicyVectorizedSampler already flattens the observations at line 113:
input_obses = obs_space.flatten_n(obses)
then, the observation_space used by the ContinuousMLPPolicy is different from the space of the observations received from the sampler.
I am experiencing a problem with version v2020.06.3 training with DDPG+HER on the FetchReach-v1 environment. Using the provided example, I get an index error:
IndexError: only integers, slices (
:), ellipsis (
...), numpy.newaxis (
None) and integer or boolean arrays are valid indices
which comes from
File (...)/python3.7/site-packages/garage/tf/policies/continuous_mlp_policy.py", line 144, in get_actions observations = self.observation_space.flatten_n(observations)
from what I could see the problem comes from the fact that the OffPolicyVectorizedSampler already flattens the observations at line 113:
input_obses = obs_space.flatten_n(obses)
then, the observation_space used by the ContinuousMLPPolicy is different from the space of the observations received from the sampler.