Simulation environment problem

reiniscimurs / DRL-robot-navigation

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

MIT License

634 stars 126 forks source link

Hi,

Randomness helps quite a lot in training. It is unclear to me if the different world was used in training as well or only in testing. In any case, I suggest checking if there are some major differences in how state is represented. For example, if your training world is small, lets say 5x5 meters, your trained sensor values will have seen only 5 meters max and trained on these values. Now, if you would put that kind of model in a 10x10 world, it would get confused. But further than that it is difficult to say anything without knowing what the worlds you used look like.

reiniscimurs / DRL-robot-navigation

Simulation environment problem #132