Shuijing725 / CrowdNav_DSRNN

[ICRA 2021] Decentralized Structural-RNN for Robot Crowd Navigation with Deep Reinforcement Learning
https://sites.google.com/illinois.edu/crowdnav-dsrnn/home
MIT License
97 stars 23 forks source link

2 agent scenarios #8

Closed evantancy closed 2 years ago

evantancy commented 3 years ago

Hi I was wondering if it's possible to spawn a single human and a single robot for certain scenarios to evaluate side preference for passing, overtaking and crossing? I was trying to do that using the example model and it seems that the robot does well in approaching the goal but lingers around the last 0.5m of its goal. I've set sim.human_num = 1 and spawned the robot and human directly opposite on a circle for passing.

Shuijing725 commented 3 years ago

Yes, it is possible. You can just change the human number and starting & goal positions of each agent in the simulator.

It sounds like you are in the right direction, perhaps the model just needs more training or some hyperparameter tuning (including reward shaping) to get rid of the lingering behavior.

evantancy commented 3 years ago

Thanks for the recommendation. Any idea which values I should start changing? I noticed that the difference between DSRNN and other papers is that DSRNN's reward function has much higher magnitudes.

Shuijing725 commented 3 years ago

You can try things out and do experiments to test your hypothesis. In fact, we could have normalized the reward to [-1, 1]. Feel free to try this.