Results from the training process

reiniscimurs / DRL-robot-navigation

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

MIT License

486 stars 97 forks source link

Results from the training process #104

Closed 1390806607 closed 2 months ago

1390806607 commented 5 months ago

During the training process, I found that the car has been in a circle, this is what happened

reiniscimurs commented 5 months ago

Hi,

See the discussions on convergence: https://github.com/reiniscimurs/DRL-robot-navigation/issues/103