Acmece / rl-collision-avoidance

Implementation of the paper "Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning"
https://arxiv.org/abs/1709.10082
326 stars 92 forks source link

Hyperparmeters values are different from the paper #6

Closed afakharany93 closed 4 years ago

afakharany93 commented 4 years ago

Hello, I have noticed that there is a difference in some hyper-parameters values between this implementation and the paper, is there a justification for that? For an example the coefficient multiplied by the value loss in the paper is 1.0 while in your implementation it's 20.0. Also I would like to ask if this is the official repo for the paper or is it your attempt at reimplementing the paper? Thanks and enjoy the holidays

Acmece commented 4 years ago

Hi, This is not the official repo for the paper. I reimplemented it using the clipped surrogate objective instead of KL divergence.