Acmece / rl-collision-avoidance

Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"
https://arxiv.org/abs/1709.10082
326 stars 92 forks source link

Stage 2 error #22

Open Balajinatesan opened 3 years ago

Balajinatesan commented 3 years ago

Env 03, Goal (-07.0, 009.5), Episode 00000, setp 097, Reward 12.6 , Reach Goal, Env 04, Goal (-12.5, 004.0), Episode 00000, setp 052, Reward -33.4, Crashed, Env 00, Goal (-18.0, 011.5), Episode 00000, setp 110, Reward 13.0 , Reach Goal, Env 01, Goal (-18.0, 009.5), Episode 00000, setp 095, Reward 12.9 , Reach Goal, Env 05, Goal (-12.5, 017.0), Episode 00000, setp 081, Reward -28.1, Crashed, Env 02, Goal (-07.0, 011.5), Episode 00000, setp 044, Reward -30.0, Crashed, Traceback (most recent call last): File "ppo_stage2.py", line 212, in run(comm=comm, env=env, policy=policy, policy_path=policy_path, action_bound=action_bound, optimizer=opt) File "ppo_stage2.py", line 120, in run obs_size=OBS_SIZE, act_size=ACT_SIZE) File "/home/balaji/rover_ws/src/rl-collision-avoidance/model/ppo.py", line 204, in ppo_update_stage2 obss = obss.reshape((num_step*num_env, frames, obs_size)) ValueError: cannot reshape array of size 1966080 into shape (5632,3,512) Hi, I got this error when I trained the second stage mpiexec -np 44 python ppo_stage2.py Do I need to change anything to train the model. and also I finished the training for the first model can you guys give me an idea of how to implement the code on the real robot. If any of you guys have any idea please share with me here or at Gmail(balajinatesan30@gmail.com) you can mail me. This might be a great help for us to get succeed. @Acmece @BingHan0458

zzhangcy commented 1 month ago

Hello, I also have this problem. Did you solve it? thanks for your replay.