Closed maxspahn closed 2 years ago
The observation is flattened after it is being checked. So for checking it is not necessary to flatten the observation space as well. As a RL user, do you also need to explicitly access the observation space? If so, you are right and we need to also flatten the observation space.
@alxschwrz
Okay got it. Many RL frameworks (e.g. SB3) check the observations against the initialized observation_space during training. I expect, that an error is thrown during training in that case. If we would like to be compatible with them without further manual changes of RL users, we should also set the correct spaces when initialising the environment. Could also be subject to a further PR though, since there would need to be some more changes to be compatible for RL usage anyway.
Adds the option to flatten the observations from a dict to a plain numpy.ndarray. This could be beneficial for some RL-users.