Open rusu24edward opened 3 years ago
Hello,
I observed the same issue with a tuple Gym environment in Ray 1.0.0 using Tensorflow as a framework, with an error in compute_actions, the error is in tf_policy trying to get the shape of fetched. This error does not appear using pytorch.
Reverting to 0.8.5 allows to train an agent with Tensorflow (thanks Edward!).
What is the problem?
I have an environment with a tuple action space. During training, there is no problem. However, when I attempt to demonstrate the learned policy, I get actions of the wrong shape. Like in #3048, I get actions that are 2d arrays instead of the expected 1d arrays.
Ray version and other system information (Python version, TensorFlow version, OS): Ray: 0.8.5 Python: 3.7.7 TF: 2.3.0 OS: Mac 10.14
Reproduction (REQUIRED)
Notice that the printed action is a tuple where the second element is a 2d array instead of a 1d array, just like #3048. This only appears to happen via
agent.compute_action(obs)
and not during training.If we cannot run your script, we cannot fix your issue.