As we do in the coupled version of SAC, where we gather data from all ranks before running a distributed update, we should add the possibility for all Dreamer's implementaation (P2E included) to gather data from all ranks before running a distributed update. We could also let the user choose the preferred behavior.
As we do in the coupled version of SAC, where we gather data from all ranks before running a distributed update, we should add the possibility for all Dreamer's implementaation (P2E included) to gather data from all ranks before running a distributed update. We could also let the user choose the preferred behavior.
cc @michele-milesi