Closed SandervanL closed 1 year ago
Seems like this is because of a gym
update. Downgrading gym
worked, as well as updating the code to expect a second info
returned value on env.reset
, as well as expecting terminated, truncated
instead of done
when calling env.step
Dear,
I encountered the following error when trying to execute your code:
At the error line,
state = ([-0.001 -0.02 -0.023 0.049], {})
is a tuple of a list and a dictionary: , andself.partially_obs_dims = [1, 3]
is a list.I executed the following command:
python policies/main.py -cfg configs/pomdp/cartpole/v/rnn.yml --algo sacd
I am running Python 3.9.7, with the following requirements (installing your conda environment did not work for me on Windows):