Hello, I'm working on ROS KInetic with Ubuntu 16.04.
Anyway, according to here, the distance_rate which corresponds to reward for distance is bigger than 2 in the case of the current goal-robot distance is shorter than absolute goal-distance.
But in environment code, the distance_rate argument is smaller in the upper case; in other words, I have a feeling that it should be modified like,
from distance_rate = 2 ** (current_distance / self.goal_distance)
to distance_rate = 2 ** (self.goal_distance / current_distance)
I think the reward would get bigger and bigger when the robot gets farther from the goal point if it moves following the default code.
Would you happen to teach me why did you set the distance_rate like that?
Hello, I'm working on ROS KInetic with Ubuntu 16.04.
Anyway, according to here, the
distance_rate
which corresponds to reward for distance is bigger than 2 in the case of the current goal-robot distance is shorter than absolute goal-distance.But in environment code, the distance_rate argument is smaller in the upper case; in other words, I have a feeling that it should be modified like,
from
distance_rate = 2 ** (current_distance / self.goal_distance)
todistance_rate = 2 ** (self.goal_distance / current_distance)
I think the reward would get bigger and bigger when the robot gets farther from the goal point if it moves following the default code.
Would you happen to teach me why did you set the
distance_rate
like that?Thanks in advance. :)