How exactly monobeast and polybeast are different in performance perspective?

facebookresearch / torchbeast

A PyTorch Platform for Distributed RL

Apache License 2.0

734 stars 113 forks source link

Hi @heiner,

Thank you for your kind reply for the previous issue (https://github.com/facebookresearch/torchbeast/issues/25). As I understand, I need to use polybeast to reproduce the SpaceInvaders results.

But could you please elaborate a little bit more on the following: "The MonoBeast version you are using has the upside of being simpler to install and run, but uses a different design that impacts RL performance in hard to understand ways".

I assume that polybeast enables much faster than monobeast, but what exactly is the reason of the score gap between those two? e.g. better exploration at the early stage of training, less policy lag during environment interaction ...

facebookresearch / torchbeast

How exactly monobeast and polybeast are different in performance perspective? #28