mal-lang / mal-simulator

Apache License 2.0
2 stars 1 forks source link

Register the Attacker Entry Points as the Observed State After A Reset #11

Closed andrewbwm closed 8 months ago

andrewbwm commented 8 months ago

The attackers' entry point should show in the observation space after a reset. Right now the entry points never make it into any observation.

andrewbwm commented 8 months ago

Fixed in 094d4a1794d174e56aef3cf98067eedf891dc48b.

The reset method now goes through the same function to generate observations, rewards, terminations, truncantions, and infos. Even though it is only expected to make use of observations and infos. However, if a non-viable episode is given it will immediately terminate it, but that might be the desired behaviour anyway.