There seems to be a bug in the step function of the robot_supervisor.py

aidudezzz / deepbots

A wrapper framework for Reinforcement Learning in the Webots robot simulator using Python 3.

GNU General Public License v3.0

236 stars 50 forks source link

if super(Supervisor, self).step(self.timestep) == -1: exit() self.apply_action(action) return ( self.get_observations(), self.get_reward(action), self.is_done(), self.get_info(), )

Hello @RLMilestone, thank you for opening this issue! This is a point of debate for the project since the very beginning. Indeed it seems more natural to step the controller after calling apply_action in the robot-supervisor scheme.

We will have to look into it in depth, because it would be nice for us to do the same in both the robot-supervisor scheme and the emitter-receiver scheme, but in the emitter-receiver scheme it might cause unforeseen issues due to the way the messages are transmitted.

aidudezzz / deepbots

There seems to be a bug in the step function of the robot_supervisor.py #80