Closed andrewbwm closed 8 months ago
Fixed in 094d4a1794d174e56aef3cf98067eedf891dc48b.
The reset method now goes through the same function to generate observations, rewards, terminations, truncantions, and infos. Even though it is only expected to make use of observations and infos. However, if a non-viable episode is given it will immediately terminate it, but that might be the desired behaviour anyway.
The attackers' entry point should show in the observation space after a reset. Right now the entry points never make it into any observation.