chainer / chainerrl

ChainerRL is a deep reinforcement learning library built on top of Chainer.
MIT License
1.17k stars 226 forks source link

Synchronous parallel training #150

Open muupan opened 6 years ago

muupan commented 6 years ago

Asynchronous parallel training like A3C is supported by ChainerRL, but synchronous parallel training, where multiple actors interact with their own environments in a synchronous manner, is not supported yet. It is beneficial in that they are more stable and can utilize GPU computation. It can be also used for multi-agent environments with simultaneous actions.

Since most of RL algorithms can naturally support it, we should define a common interface for it.

muupan commented 6 years ago

Related PRs: https://github.com/chainer/chainerrl/pull/149 https://github.com/chainer/chainerrl/pull/295