Open JankowskiChristopher opened 5 months ago
deepmind/finger-turn_hard also gives similar error.
@adityab @danielpalen I tried using such concatenation
np.concatenate([v.flatten() for v in obs.values()])
in self.prepare_obs(observation)
. It works in terms of not giving errors (tested on cheetah_run), but the rewards are much lower. Maybe this is due to the MultiInputPolicy and therefore code needs to be changed in another place too to make it work.
Hello, I am trying to benchmark your code on more tasks from deepmind/* but they are not working. There seems to be a bug in the
prepare_obs
function insbx/common/policies.py
. I attach stack trace below:Task deepmind/quadruped-run
Task deepmind/humanoid-walk
I believe that this is due to different dict formats of observation returned by
shimmy
. I once had similar problems in another project and fixed them by using the function from TD-MPC2 GitHub repository:Maybe you can try this as well and will work better.