Closed thomaslautenb closed 3 months ago
Hi @thomaslautenb,
Thanks for reporting this.
Which version of Gymnasium are you using? The vector wrapper heavily relies on Gymnasium (and it changed recently in Gymnasium). The last time I ran that code it worked, it was (I think) with Gymnasium 0.28.1.
Now, for the longer term, we are revamping the wrapper implementations to match Gymnasium 1.0. The PR is here: https://github.com/Farama-Foundation/MO-Gymnasium/pull/95 it would be great if we could validate that the new vector wrappers work with MORL-Baselines.
I don't have time to finish this month as I'm defending my thesis and moving abroad but it is on my todo for this summer. If you have time, you could also try it on your own by working from the current PR on MO-Gymnasium and Gymnasium 1.0 :-).
Hi @thomaslautenb,
First, the PR for the migration to 1.0 is moving forward, see #109. Second, I've talked with different people running PGMORL these days and they did not experience any issues. Could you tell me if it is still buggy?
Hi @ffelten excuse me for not getting back to you. I was indeed using an older gymnasium version. Running with 0.28.1 worked. You mgiht close the issue!
Thanks for the feedback!
I encountered a Problem with implementation of the MO_PPO. There was a mismatch in dimensions of the _reward vector in the sync_vector_env environment.
I worked around the issues by extending the dimension of the reward vector by the number of objectives -> reward_dim. (its hardcoded and not pretty)
You might want to have a look on this.
I can provide more extensive report on the issue if required.