[Bugfix] Prevent cumulative reward from exceeding 1.0 in continuous mode.

google-research / ravens

Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.

https://transporternets.github.io

Apache License 2.0

562 stars 97 forks source link

[Bugfix] Prevent cumulative reward from exceeding 1.0 in continuous mode. #12

Closed kevinzakka closed 3 years ago

kevinzakka commented 3 years ago

This corrects an error I made in returning > 0 reward after the task was complete but the robot was still executing the last movements in a plan (e.g. postplace pose, home pose). The reward is now correctly capped at 1.0 for all tasks.