Open delara38 opened 1 year ago
Hi @delara38,
Yes you are right. We have made these changes in the dev branch of the repo, and in fact are planning on making the return type of the environment a bit more structured with dataclasses. These changes will be integrated into the main branch and the next release in the next 2-3 weeks. For now, if you need the termination/truncation change, please use the dev branch.
great thanks!
Hi,
the gym environments now return the 5-tuple (next state, action, reward, terminate, truncate, info) instead of their previous 4-tuple setup; however, RLHive still expects their previous setup at each transition and needs to be ammended.
I believe that all that is needed is to the step function in gym_env.py from
to (assuming that the rest of RLHive will continue to only return a done boolean)